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INTRODUCTION 

Field of the Invention 

The field of the invention is phycobiliprotein-assisted expression and folding of proteins 
and the production of function-added recombinant phycobiliproteins. 

Background of the Invention 

Many foreign proteins expressed in bacteria form insoluble aggregates called inclusion 
bodies. Recovery of the recombinant protein of interest requires dissolving the inclusion bodies 
under denaturing conditions, removing the denaturant to allow the recombinant protein to fold, 
and finally purifying the recombinant protein. This process is laborious and frequently gives low 
yields. Inclusion bodies are formed because the aggregation of the newly synthesized 
recombinant polypeptide is faster than its folding into the native structure. In one aspect, the 
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invention provides bifunctional fusion constructs which are rapidly incorporated into a 
macromolecular structure such that the fusion proteins are separated from one another and unable 
to self-associate. 

In a more particular aspect, the macromolecular structures are oligomeric 
5 phycobiliproteins - a family of structurally related photosynthetic accessory proteins naturally 

found, inter alia, in cyanobacteria, the chloroplasts of the Rhodophyta (red algae) and in those of 
the Cryptophyceae. When they carry covalently attached linear tetrapyrrole prosthetic groups 
(bilins), these proteins can exhibit brilliant colors and intense fluorescence, making them 
valuable specific labeling reagents. Unfortunately, the diversity of such reagents has been 
10 restricted to naturally occurring phycobiliproteins and their target specificity generated by 
chemical conjugation, which generates heterogeneity (see, e.g. Siiman et al., 1999, Bioconj 
Chem, 10, 1090-1 106). The use of phycobiliproteins in the invention also overcomes these prior 
art limitations and provides homogenous, specific labeling reagents. 

15 SUMMARY OF THE INVENTION 

We have found that phycobiliprotein subunits are tolerant to terminal extensions of 
different sizes. A phycobiliprotein subunit domain in a fusion protein is able to assemble with a 
cognate partner subunit to form heterodimers, and frequently further assemble into even higher- 
order aggregates, and into the light-harvesting antenna complexes, phycobilisomes. The non- 
20 phycobiliprotein part of the fusion protein (the displayed domain) is exposed on the surface of 
the oligomeric phycobiliproteins. We define this fusion protein expression system as 
"phycobilisome display". Newly synthesized fusion polypeptides are quickly separated from 
each other by virtue of the assembly of the phycobiliprotein domain (the carrier domain) into 
oligomers. The non-phycobiliprotein moiety (the displayed domain) of the fusion protein is 
25 displayed on the surface of oligomer, e.g. the rods of the phycobilisome, and is able to fold into 
functional proteins while sequestered from interaction with other unfolded polypeptides. 
Phycobilisome display can therefore be used as an alternative folding environment for difficult- 
to-fold proteins, especially those that have been found difficult to fold in other organisms. The 
displayed protein can be used as a fusion construct bearing a fluorescent phycobiliprotein tag, or 
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be separated from the carrier phycobiliprotein domain by cleavage of the linker peptide between 
the carrier domain and the displayed domain by a specific protease. Either embodiment may be 
practiced in cells (e.g. using a resident protease) or in vitro (e.g. using purified fusion proteins, 
cell-free expression systems, etc.), though cleavage is preferably practiced outside the cell to 
5 control its timing with respect to folding. 

'Accordingly, the invention provides methods and compositions for displaying a 
functional polypeptide domain on an oligomeric phycobiliprotein. The compositions include 
fusion proteins comprising a functional displayed domain and a functional phycobiliprotein 
domain incorporated in a functional oligomeric phycobiliprotein. In particular embodiments, the 

10 phycobiliprotein domain is a natural phycobiliprotein domain or modified variant thereof; the 
functional oligomeric phycobiliprotein is an a, p heterodimer; the displayed domain comprises 
an affinity tag, an oligomerization moiety, a specific binding moiety and/or a signaling moiety; 
and/or the displayed domain is refractive to expression in E. coli: The compositions also include 
functional oligomeric phycobiliproteins comprising the subject fusion proteins and cells 

15 comprising such oligomeric phycobiliproteins. 

The subject methods include methods for making a functional displayed domain 
comprising the step of combining a polypeptide comprising a displayed domain and a 
phycobiliprotein domain with a phycobiliprotein subunit under conditions to form a subject 
fusion protein. In particular embodiments, the methods further comprise prior to the combining 

20 step, the step of making the polypeptide by expressing a nucleic acid encoding the polypeptide; 
and/or after the combining step, the step of separating the functional displayed domain from the 
functional phycobiliprotein domain. The methods steps may occur intracellularly, e.g. in a cell 
which is or is a progeny of a natural cell which naturally makes functional phycobiliprotein, or a 
cell engineered to produce functional phycobiliprotein. 

25 

DETAILED DESCRIPTION OF PARTICULAR EMBODIMENTS OF THE INVENTION 
The following descriptions of particular embodiments and examples are offered by way 
of illustration and not by way of limitation. 
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The invention provides methods and compositions for displaying a functional polypeptide 
(the foreign protein or displayed domain) on an oligomeric phycobiliprotein. The compositions 
include fusion proteins comprising a functional displayed domain and a functional 
phycobiliprotein domain incorporated in a functional oligomeric phycobiliprotein. 
5 A functional phycobiliprotein domain is capable of assembling the fusion protein in a 

functional oligomeric phycobiliprotein. Preferred domains provide at least 1%, preferably at 
least 10%, more preferably at least 50%, more preferably at least 75%, more preferably at least 
90% and most preferably substantially equivalent oligomer assembly ability as that of a 
corresponding unfused phycobiliprotein, as measured in competition assays, e.g. as described 
10 herein. 

Any phycobiliprotein domain having the requisite functionality may be used and these 
may be derived from natural, semisynthetic or synthetic sequences. A wide variety of natural 
phycobiliproteins are known in the art, e.g. Apt and Grossman, 1995, J Mol Biol 248, 79-96, 
including proteins derived from many cyanobacteria, rhodophytes (red algae) and cryptomonads, 

15 etc. (see, e.g. Glazer, 1994, J Appl Phycol 6, 105-1 12; Glazer et al. 1995, Photosynth Res 46, 93- 
105), particularly phycoerythrins, phycocyanins, and allophycocyanins. In addition, a wide 
variety of methods are known for modifying such natural sequences to generate semi-synthetics, 
e.g. Glazer, 1994, supra, describes phycobiliproteins having non-natural, predetermined bilin 
compositions, and exploiting intermolecular energy transfer by functionally coupling fluorescent 

20 phycobiliprotein tags with other fluorescent tags, such as cyanine dyes, and Toole et al., 1998, 
Mol Micro 30, 475-486 describes recombinations of phycobiliprotein deletion mutants. Finally, 
known phycobiliprotein structure-function relationships (e.g. Anderson et al., 1998, Mol Micro 
30, 467-474) are exploited to generate synthetic sequence analogs using conventional methods. 
In a particular embodiment, the phycobiliprotein domain is characterizable as an a or B 

25 subunit, based on its sequence similarity to natural a and B phycobiliprotein subunits. The 

selection of a or B phycobiliprotein domains may yield different results, affecting the displayed 
domain, the carrier protein, or both, and such differences guide the selection of the carrier 
phycobiliprotein subunit. For example, the phycocyanin a subunit and the B-Ll 1-a subunit- 
fusion are preferred when the foreign protein is displayed on the N-terminus of the carrier protein 
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because of the higher yield and the better spectroscopic properties of the resulting fusion protein. 
For display on the C-terminus, phycocyanin B subunit and the oc-Ll 1-1$ subunit fusion are better 
suited because the phycocyanin a subunit is sensitive to extension on its C-terminus (usually 
leading to incomplete bilin addition and in some instances partial unfolding of the protein). 
5 In a particular embodiment, the phycobiliprotein domain confers fluorescence on the 

fusion protein, preferably providing fluorescence quantum yield and molar extinction coefficients 
at least 1%, preferably at least 10%, more preferably at least 50%, more preferably at least 75%, 
more preferably at least 90% and most preferably substantially equivalent to that of a 
corresponding unfused phycobiliprotein, measured as described herein. Preferred domains 
10 provide extinction coefficients of at least 100K, preferably at least 300K, more preferably at least 
1M and/or quantum yields of at least 0.25, preferably at least 0.5, more preferably at least 0.6, as 
described herein. In other preferred aspects of this embodiment, the fluorescence emission 
spectrum of the fusion protein is substantially equivalent to that of a corresponding unfused 
phycobiliprotein. 

15 In a particular embodiment, the phycobiliprotein domain of the fusion protein comprises 

one or more bilins, preferably functional bilins contributing to the visible absorption spectrum of 
the phycobiliprotein domain and fusion protein, preferably natural bilins. In a particular 
embodiment, the bilins are covalently coupled to the phycobiliprotein domain, preferably through 
cysteine thioether linkages, preferably at natural bilin attachment sites. The bilins may be 

20 coupled to the phycobiliprotein domain by any mechanism that provides the requisite 

functionalities, including enzymatic addition. Accordingly, in a particular embodiment, the 
phycobiliprotein domain of the fusion protein provides a substrate for enzymatic bilin addition, 
which may provide natural or non-natural bound bilin distribution, preferably a substrate for 
enzymes which naturally modify a corresponding natural phycobiliprotein. 

25 The displayed domain may be any polypeptide having a requisite functionality and being 

compatible with that of the phycobiliprotein domain and with assembly of the fusion protein in a 
functional oligomeric phycobiliprotein. Accordingly, a wide range of displayed domains may be 
used including domains comprising an affinity tag, an oligomerization moiety, a specific binding 
moiety and/or a signaling moiety, and including polypeptides which may be coupled to 
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phycobiliproteins by chemical conjugation invention (see, inter alia, US Pat Nos. 4,520, 1 10; 
4,542,104; 4,857,474; 5,055,556; 5,707,804; 5,728,528; and 5,869,255). 

The displayed domain is preferably of length and sequence sufficient to provide a 
constrained structure, including at least secondary structure, preferably tertiary structure. In a 
5 particular embodiment, the constrained structure is of complexity sufficient to require complex 
folding and is poorly expressed independent of the fusion protein in active form in conventional 
expression systems, particularly conventional yeast (e.g. S. cerevisiae) and bacterial (e.g. E. coli) 
expression systems. Preferred displayed domains are refractive to expression when expressed 
independent of the fusion proteins, i.e. substantially not expressed in active form, in conventional 

10 expression systems, particularly E. colL The displayed domain may have any spectrophotometry 
properties compatible with the requisite functionalities of the fusion protein. In a particular 
embodiment, the displayed domain is substantially transparent to the wavelengths of visible light 
absorbed by phycobiliproteins, and therefore does not substantially affect the fusion protein 
and/or oligomeric phycobiliprotein light-harvesting function, and/or is substantially transparent 

15 to the wavelength(s) of energy emitted by the phycobiliprotein domain, and therefore is not 
substantially affected by such energy. 

The displayed domain may frequently be displayed on either terminus of the 
phycobiliprotein domain - important because many proteins preferentially tolerate extension on 
one of their termini. Preferred expression orientation is readily determined empirically, and in 

20 many cases (e.g. for making phycobiliprotein-labeled fluorescent reagents) is advised by 

published C- and N-terminal GFP fusions. In particular embodiments, display on the N-terminus 
of a phycobiliprotein B subunit, rather than the a subunit, is more conducive to folding of certain 
displayed proteins. Use of the subunit-fusion phycocyanins a-Ll 1-fl and B-Ll 1-a as the carrier 
protein may also have certain advantages: the fusion proteins tend to be more stable (see, e.g. 

25 Example B, below), and the 1:1 stoichiometry of a and B subunits is ensured. 

The fusion proteins may comprise additional components as desired, which may provide 
modules of functionalities, such as affinity handles, dimer- or oligomerization domains, 
stabilization domains, specificity domains, signaling domains, etc., apart from any such 
functionality/ies provided by the displayed domain. For example, for constructs to be used as 
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fluorescent labels, introduction of GCN oligomerization domains enhances both the 
spectroscopic value (more chromophores) and binding affinity (more sites for intermolecular 
interaction). 

In particular embodiments, the fusion protein comprises a specific binding moiety 
5 comprising at least one of a specific binding pair, such as a receptor - ligand pair, e.g. an 
immunoglobulin antigen-binding domain or antigenic domain, a lectin saccharide-binding 
domain or glycosylated or glycosylatable domain, an avidin or streptavidin biotin-binding 
domain or biotinylated or biotinylatable (i.e. providing a substrate for enzymatic biotinylation) 
domain, etc. In a particular embodiment, the protein comprises a biotinylated or biotinylatable 

10 domain, which is preferably biotinylated in the expression system (e.g. cell) selected for 

expression of the fusion protein. A wide variety of synthetic, semi-synthetic and natural such 
domains are known in the art, see e.g., Schatz et al. 1993, Bio/Technology 11, 1 138-1 143; 
Tatsumi et al., 1996, Anal Biochem 243, 176-180; Samols et al. 1988, J Biol Chem 263, 6461- 
6464, including homologs in phycobiliprotein producing cyanobacteria, e.g. Gornicki et al. 1993. 

15 J Bacteriol 175, 5268-5272; Phung et al., GenBank Accession No. U59235; Nakamura et al. 

1998 Nucl Acids Res 26, 63-67. In fact, enzymes sufficient to biotinylate biotinylatable domains 
have been characterized (e.g. Beckett et al. 1999, Protein Sci 8, 921-929; Buoncristiani et al. 
1988, J Biol Chem 263, 1013-1016), permitting in vitro biotinylation (e.g. Li et al., 1992, J Biol 
Chem 267, 855-863). These biotinylated domains permit especially convenient affinity 

20 purification tags (e.g. Cronan 1990, J Biol Chem 265, 10327-10333, Example C, below) and are 
useful in the many well developed biotin/avidin applications (e.g. Wilchek and Bayer (ed) 1990, 
Methods Enzymol 184, Academic Press, NY). 

In another example, various spacers or flexible linker peptides providing a variety of 
functionalities, such as a specific endopeptidase recognition and/or cleavage site, an affinity- 

25 purification tag, etc., may be used between the displayed and phycobiliprotein domains. For 
example, when displayed C-terminally to the phycobiliprotein domain, a specific protease 
recognition and cleavage site can be engineered immediately upstream from the displayed 
domain so, upon cleavage with the protease, the displayed domain can be cleanly released from 
the carrier protein. This strategy also works for most proteins displayed on the N-terminus of the 
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carrier protein because the functions of most displayed proteins are not affected by C-terminal 
extensions several residues long. In situations where such C-terminal extension is highly 
undesirable, an intein domain (Perler FB, Jan 1, 2000, Nucleic Acids Res 28, 344-345 "InBase, 
the Intein Database") can be engineered immediately downstream from the displayed domain. 
5 Subsequent excision of intein cleanly releases the displayed domain from the carrier protein. 

The linkers may also be used to facilitate display of domains that would otherwise 
interfere with oligomeric phycobilisome assembly. The length and amino acid sequence 
requirements of such functionality are readily determined empirically for a given fusion 
construct. Generally, the linkers are preferably from at least 5, preferably at least 10 residues in 

10 length, typically requiring no more than 50, and more often no more than 30 residues. To 

facilitate an unintrusive orientation (see, e.g. Example A, below) small, flexible residues such as 
Ala, Gly and Ser are particularly convenient components. 

The fusion proteins are incorporated in a functional oligomeric phycobiliprotein, see e.g. 
Glazer, 1994, Adv Mol Cell Biol 10, 119-149, comprising at least a dimer, preferably an a, B 

15 heterodimer. The oligomers, especially when assembled into higher-order structures such as 
trimers, hexamers, rods and phycobilisomes, constrain one displayed protein from interacting 
with another. This is particularly useful in producing proteins whose function is (1) harmful to 
the cell but (2) dependent on the formation of dimer or multimer. In a particular embodiment, the 
oligomeric fusion phycobiliproteins are structurally substantially identical to those of 

20 corresponding natural phycobiliproteins and preferably provide fluorescence quantum yield and 
molar extinction coefficients at least 1%, preferably at least 10%, more preferably at least 50%, 
more preferably at least 75%, more preferably at least 90% and most preferably substantially 
equivalent to those of corresponding natural phycobiliproteins. In other preferred aspects of this 
embodiment, the fluorescence emission spectrum of the oligomeric fusion phycobiliproteins is 

25 substantially equivalent to those of corresponding natural phycobiliproteins. 

Because higher order oligomers (phycobilisomes) can form a complex with the water- 
splitting oxygen-evolving photosystem n, the surrounding environment of phycobilisomes, 
unlike virtually all other cellular environments, may be oxidative enough for spontaneous 
formation of disulfide bonds. Therefore phycobilisome display may be applied to the expression 
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and folding of disulfide bond-containing proteins. Accordingly, the compositions also include 
functional oligomeric phycobiliproteins comprising the subject fusion proteins and cells 
comprising such oligomeric phycobiliproteins. 

The fusion proteins are expressible in any convenient system compatible with expression 
5 of the fusion protein, preferably an empirically optimized host cell, host cell of the most closely 
related natural phycobiliprotein, or lysate or extract thereof. In a particular embodiment, the 
fusion protein is expressed at least 1%, preferably at least 10%, more preferably at least 50% ? 
more preferably at least 75%, more preferably at least 90% and most preferably substantially 
equivalent to that of a corresponding unfused phycobiliprotein domain. A wide variety of 

10 expression systems may be used. For example, Anabaena cells expressing phycobilisome 
displayed proteins have normal phenotype and growth rates and can be grown large scale in 
defined inorganic salt media. Alternatively, cells or protoplasts may be engineered or 
reconstituted to express requisite lyase subunits (e.g. cpcE and cpcF, e.g. Fairchild et al., 1 994 ? J 
Biol Chem 267, 16146-16154) and/or phycobiliprotein linker proteins (e.g. cpcC, e.g. Swanson 

15 et al., 1992, J Bact 174, 2640-2647) under conditions wherein the requisite oligomeric 

phycobilisome proteins are formed. Reconstituted heterologous cellular systems require either 
high levels of phycobiliprotein domain expression (sufficient to form dimers; see, Glazer et al., 
1973, J, Biol Chem 248, 5679-5685) or the expression of a compatible heterophycobiliprotein 
domain (to form heterodimers), i.e. simply attempting to express a labeled phycobiliprotein 

20 fusion protein with a coexpressed lyase in a hererologous cell is insufficient to form the required 
oligomeric phycobiliproteins (Schroeder 1997, Phycobiliproteins: Biosynthesis and 
Applications,UC Berkeley, Dissertation). Alternatively, extracellular or cell free systems, such as 
lysates, extracts and reconstituted in vitro bilin addition systems may be used (e.g. Arciero et al. 
1988, J Biol Chem 263; 18343-18349; 18350-18357; 18358-18363). Because phycobiliproteins 

25 and phycobilisomes are made at very high levels in the cell, especially under low light intensity, 
phycobiliprotein display also helps to increase the yield of the displayed protein. 

The subject methods include methods for making a functional displayed domain, the 
method comprising the step of combining a polypeptide comprising a displayed domain and a 
phycobiliprotein domain with a phycobiliprotein subunit under conditions to form a subject 
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fusion protein. In particular embodiments, the methods further comprise prior to the combining 
step, the step of making the polypeptide by expressing a nucleic acid encoding the polypeptide; 
and/or after the combining step, the step of separating the functional displayed domain from the 
functional phycobiliprotein domain. The methods steps may occur intracellularly, e.g. in a cell 
5 which is or is a progeny of a natural cell which naturally makes functional phycobiliprotein. 

Our extensive studies on phycobiliprotein subunit fusion constructs demonstrate the 
versatility of phycobilisome display. In fact, our display concept may be practiced with other 
macromolecules, such as ribosomes, and such macromolecular complex display can also help in 
posttranslational modification of proteins. For example, proteins displayed on 80S ribosomes in 
10 eukaryotic cytosol can undergo different posttranslational modifications than proteins displayed 
on 70S ribosomes in the mitochondria and chloroplasts. Similarly, proteins displayed on different 
sides of the ER are also subjected to different modifications. 

EXAMPLES AND DETAILED EXPERIMENTAL PROTOCOLS 
15 Example A. Expression and characterization of recombinant phycobiliprotein fusion proteins. 
We describe here the expression of genes engineered to encode Anabaena sp. PCC7 1 20 C- 
phycocyanin a and P polypeptides bearing a 24-residue N-terminal peptide tag, incorporating a 
block of six His residues (generally referred to as the "6xHis tag"), in wild-type and mutant cells 
of the filamentous cyanobacterium Anabaena sp. PCC7120. The 6xHis tag allows one-step 
20 purification of the recombinant polypeptides by immobilized metal-ion affinity chromatography 
[IMAC; (8)] away from the wild-type phycobiliproteins. We show here that phycocyanobilin 
(PCB) is correctly attached to the His-tagged phycocyanin a and P polypeptides and that they 
fold to yield molecules whose spectroscopic properties correspond to those of wild-type a and p 
polypeptides. 

25 Cultures and Strains - Escherichia coli strain DH5a (11) grown in LB medium was used 

in all cloning and expression experiments. Plasmid-encoded resistance to ampicillin or 
spectinomycin (Sp) was selected at an antibiotic concentration of 100 |ig ml" 1 . For 
overexpression of His-tagged proteins, isopropyl-p-D-thiogalactoside (IPTG) was added to a 
final concentration of 0.5 mM to exponentially growing cultures to derepress the trc promoter. 
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Induced cultures were usually grown with vigorous aeration for 5 hr at 30°C to reduce formation 
of inclusion bodies. Induced cells were pelleted and stored at -20°C until use. 

Anabaena sp. strain PCC7120 and its derivative strains were grown at 30°C on modified 
AA minimal medium plus nitrate (12) under medium (75 |iE nr 2 s _1 ) to high (200 |iE nr 2 s' 1 ) 
5 light intensity provided by cool-white fluorescent bulbs. Anabaena cultures overexpressing His- 
tagged phycocyanin subunits were grown in half-strength AA plus nitrate medium (bubbled with 
5% CO2 in air, with 2.5 mM HEPES pH 9.0 buffer to maintain pH) to late-exponential phase 
before addition of IPTG to 0.5 mM. The cultures were induced for 2 to 3 days before cells were 
harvested and stored at -20°C. Triparental mating to introduce free-replicating plasmids into 

10 Anabaena cells from E. coli was performed essentially as described (13). Strains bearing the 

plasmid-bome aadA gene, encoding resistance to streptomycin (Sm) and to Sp, were selected at ! 
[Xg ml -1 Sm plus 10 \xg ml -1 Sp on agar-solidified medium and at 20 |ig ml -1 Sp alone in liquid 
medium. Mutant strains B646, B64328, and B64407 generated by transposon insertion (7) were 
selected with neomycin sulfate (Nm) at 200 [ig ml" 1 on solid medium and 10 |j,g ml -1 in liquid 

15 medium. 

Cloning of Anabaena sp. PCC7120 C-Phycocyanin a and p subunits - Standard 
procedures were used for most molecular biological manipulations (14). Genomic DNA was 
isolated from Anabaena sp. PCC 7120 as described (15). The cpcA and cpcB genes encoding the 
a and p subunits of phycocyanin, respectively, were amplified for genomic DNA by the 

20 polymerase chain reaction (PCR; 16). The 0.5-kb PCR fragment was digested with the restriction 
enzymes Ndel and Hindm, and cloned into Ndel- and /frm/DI-digested cloning vector pUC 19 
[(17); GenBank accession No. X02514], giving plasmid pBS185. The 0.5-kb PCR fragment was 
digested with enzymes Ndel and EcoRl, and cloned into Ndel- and £c6>RI-digested pUC19, 
giving plasmid pBS251. All fragments generated by PCR were sequenced to verify fidelity using 

25 the PRISM 373 DNA sequencing system and the dye-terminator cycle-sequencing kit from 

Applied Biosystems (Foster City, CA). DNA and amino-acid sequence analyses were performed 
with the programs Editbase (Purdue Research Foundation and USDA/ARS) and Lasergene 
(DNASTAR Inc., Madison, WI). 
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In verifying the sequences of PCR products, two discrepancies were discovered near the 
end of the published cpcB DNA sequence from the same organism (5). Base number 487 
(numbering of the published sequence) is an A instead of a T. This changes the Cys codon (TGC) 
to a Ser codon (AGC). Another difference is within the tandem CTG repeats shortly before the 
5 stop codon. There are four such CTG triplets, rather than only three shown in the published 

sequence. In addition to sequencing PCR-generated clones, the base changes were also confirmed 
by sequencing of a restriction fragment containing the entire cpcB gene. The last 36 base pairs of 
the cpcB gene sequence determined here, encompassing both corrections, encode a 12-amino 
acid sequence identical to that of another filamentous cyanobacterium, Mastigocladus laminosus 

10 [(18); GenBank accession No. M75599], and the sequence of the entire CpcB polypeptide aligns 
much better with those of other C-phycocyanin (3 subunits (19). The mass of the Anabaena sp. 
PCC7120 |3 subunit is in excellent agreement with that predicted from the corrected cpcB 
sequence. A 16.3-kb DNA sequence with the corrected cpcB sequence, the complete sequence of 
cpcF (see below), and other new information has been deposited into GenBank under accession 

15 No. AF178757. 

Construction of expression vectors - A family of plasmids was specifically constructed 
for inducible overexpression of His-tagged polypeptides in both E. coli and Anabaena sp. 
PCC71120. Plasmid pBS150v contains the following components: (A) From 0 to 1.3 kb, a portion 
from pBR322 [(20); GenBank accession No. V01 1 19] that contains the ColEl oriV for plasmid 

20 replication in E. coli, the oriT {bom) site for conjugal transfer, and the rop (repressor of primer) 
gene. Although the first 1 1 codons of the Rop open reading frame were changed in pBS150v, the 
modified Rop protein apparently can still form the homodimer four-alpha-helix structure (2 1), 
helping to maintain the plasmid at a medium copy number like that of pBR322 (20); (B) From 
1.3 to 2.3 kb, most of the C.S4 cassette (22) containing the engineered aadA gene that confers 

25 resistance to Sm and Sp; (C) From 2.3 to 4 kb, a portion from the expression vector pPROEX- 1 
(Life Technologies, Inc.) that contains the lacN gene coding for the lac repressor, and multiple 
cloning sites for construction of sequences coding for His-tagged proteins, expression of which is 
controlled by the trc promoter. The N-terminal 24 residues contain the 6xHis affinity tag as well 
as a 7-amino acid recognition and cleavage site for the tobacco etch virus (TEV) protease for 
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removal of the 6xHis tag if so desired (23, 24); and (D) From 4 to 4.6 kb, a portion from the 
expression vector pMAL-c2 (New England Biolabs, Inc.) that contains the lacZ a gene for 
blue/white screening of insert recombinants, and downstream, two strong, bidirectional, p- 
independent transcriptional terminator structures from the E. coli rrnB gene (25). The nucleotide 
5 sequence of pBS150v has been deposited in GenBank under the accession No. AF177932. 

A portion of the plasmid pDUl (26) coding for a replication protein and a resolvase was 
inserted in the unique Eco47Ul site of pBS150v, giving plasmid pBS150. This 3.7-kb fragment. 
pDUlHC, contains spontaneous mutations that enable autonomous replication of cognate 
plasmids in higher copy number in Anabaena sp. (27). Plasmid pBS152v (sequence also 

10 available from GenBank under accession No. AF177933) is similar to pBS 150v but sacrifices the 
lacZP* gene for more cloning sites like those in pPROEX-1. The pDUlHC* version, pBS 152, 
was constructed like pBS150. His-tagged constructs were usually made using the small, "v" 
versions of the plasmids and then converted to the larger, pDUlHC-containing form. When 
induced, the trc promoter initiates very strong transcription in both E. coli and Anabaena. While 

15 very tightly controlled in E. coli, the trc promoter is poorly repressed in Anabaena sp. partially 

due to the atypical promoter and the presence of some codons unfavorable to Anabaena sp. in the 
lacN gene, leading to a relatively high level of constitutive expression of the 6xHis-tagged 
proteins. Preliminary Western analyses showed only about three-fold increase in production of 
His-tagged proteins in Anabaena cultures upon induction with IPTG. 

20 Isolation of phycobilisomes - Phycobilisome (PBS) preparations were characterized on 

discontinuous sucrose density gradients as described (28), with minor modifications. Frozen or 
freshly harvested Anabaena cells were used with no obvious difference in results. Na/K-P04 
buffer (0.75M; 1 : 1 ratio of Na-P04 and K-P0 4 , pH 7.5) containing 1 mM each of NaN 3 and 
ethylenediamine tetraacetate (EDTA), was used. Individual PBS, rod, and hexamer fractions 

25 collected from the sucrose gradients were dialyzed extensively against buffer 0 (20 mM Tris-HCl 
pH 8.0, 50 mM NaCl, 50 mM KC1) to eliminate sucrose, phosphate, azide, and EDTA before 
isolation of His-tagged phycobiliproteins on a Ni 2+ -NTA column (see below). Similarly, for 
isolation of intact His-tagged PBS on Ni 2+ -NTA, the PBS fraction was dialyzed against 0.75 M 
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phosphate buffer, pH 7.5 (without EDTA and azide) prior to loading on a Ni 2+ -NTA column (see 
below). 

To isolate PBS containing His-tagged phycocyanin subunits, the PBS preparation in 0.75 
M phosphate was loaded on 1 ml of Ni 2+ -NTA resin pre-equilibrated with 0.75 M phosphate 
5 buffer. The high concentration of phosphate, along with possibly lesser steric availability of the 
6xHis tags of phycocyanin subunits incorporated within PBS, significantly decreased the rate of 
binding of PBS to the Ni 2+ -NTA resin. Several passages of the same sample through the column 
were needed to get adequate binding. The resin was then washed with 20 column volumes of 
0.75 M, phosphate buffer plus 30 mM imidazole. PBS bound on resin were eluted with 0.75 M 

10 phosphate buffer plus 200 mM imidazole. The eluate was immediately dialyzed against 0.75 M 
phosphate buffer to remove imidazole. 

Isolation of His-tagged Proteins by Immobilized Metal Affinity Chromatography - Cell 
pellets were thawed and resuspended in 10 to 20 volumes of cold (0-4°C) buffer 0. 
Phenylmethylsulfonyl fluoride (PMSF) and (3-mercaptoethanol were added to give final 

15 concentrations of 1 mM and 10 mM, respectively, immediately before breakage of cells by 

passage through a French press cell, three times at 18,000 p.s.i. Cellular debris was then removed 
by centrifugation at 4°C in an angled rotor at 30,000 xg for 20 min (preparations from E. coli) or 
at 130,000 xg for 1 hr (preparations from Anabaena sp.). The supernatant was loaded on a 
column of 1 to 3 ml of Ni 2+ -NTA resin pre-equilibrated with buffer 0. The resin was then 

20 washed consecutively with ten column volumes each of cold buffer Al (20 mM Tris-HCl pH 8.0, 
50 mM NaCl, 50 mM KC1, 20 mM imidazole, 5% v/v glycerol), buffer B (20 mM Tris-HCl pH 
8.0, 500 mM NaCl, 500 mM KC1), and buffer A2 (20 mM Tris-HCl pH 8.0, 50 mM NaCl, 50 
mM KC1, 30 mM imidazole). His-tagged proteins were eluted from the resin with buffer C (20 
mM Tris-HCl pH 8.0, 50 mM NaCl, 50 mM KC1, 200 mM imidazole). The eluate was 

25 immediately dialyzed against 50 mM Tris-HCl pH 8.0, 50 mM NaCl, 50 mM KC1, 1 mM DTT, 
to remove imidazole which at high concentrations is slightly denaturing to proteins. Recombinant 
His-tagged phycocyanin isolated in this manner consisted mostly of ap heterodimers. A crude 
estimate of the yield of His-tagged phycocyanin was calculated from the amount of 
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phycobiliprotein (expressed in units of A620nm ml) eluted with buffer C vs. the total amount of 
phycobiliprotein applied to the Ni 2+ -NTA resin. 

Size Exclusion Chromatography-High Performance Liquid Chromatography - Although 
IMAC-purified and dialyzed samples (see above) could be used with no obvious effect on the 
5 results,' proteins to be analyzed by SEC-HPLC were generally dialyzed again in 50 mM Na-P04 
pH 7.0, 50 mM NaCl, 0.5 mM DTT. SEC-HPLC separations were performed with a Waters 600 
pump and line detection achieved with a Waters 991 photodiode array detector (Waters 
Associates, Milford, MA). A mobile phase of 50 mM Na-PC>4 pH 7.0 was flowed at 0.8 ml min - 1 
through Bio-Sil 250 (Bio-Rad Laboratories, Hercules, CA) guard column (80 x 7.8 mm) and 

10 column (300 x 7.8 mm) maintained at 25°C. For analytical separations, 50 to 200 jag of 
phycocyanin was loaded [based on absorbance at the peak near 621 nm and using a molar 
extinction coefficient of 290,000 M" 1 cm -1 per monomer (29)]. Molecular size calibration 
utilized bovine gamma globulin, 158 kDa; chicken ovalbumin, 44 kDa; horse myoglobin, 17 
kDa; vitamin B 12, 1.35 kDa (Bio-Rad gel filtration standards). The void volume was determined 

15 with blue dextran, 2000 kDa (Pharmacia Biotech, Piscataway, NJ). Preparative separations were 
performed with loads of 0.5 to 1 mg phycocyanin. 

Analytical Ultracentrifugation - Measurements were performed on a Beckman XLA 

« 

centrifuge using an An-60Ti rotor maintained at 20°C and 20,000 rpm. Affinity-purified samples 
were run at a concentration of 0.5 mg ml" 1 in dialysis buffer. Data were fit to a two component 

20 regression (which was superior to a single-component fit) and the molecular weights determined 
with the HID4000 software. 

SDS-Polyacrylamide Gel Electrophoresis - Proteins were precipitated with 10% (w/v) 
trichloroacetic acid, washed once with ice-cold water, and resolubilized in SDS loading buffer 
[2% sodium dodecyl sulfate, 50 mM Tris-HCl, pH 6.8, 100 mM DTT, 0.1% Bromphenol Blue, 

25 and 10% v/v glycerol; (13)]. SDS-PAGE (30) was performed using 10% acrylamide stacking and 
14% separating gels, with monomer/bis ratio of 37.5:1. Bilin-bearing polypeptides were 
visualized as fluorescent bands by UV (> 312 nm) illumination after staining with 10 mM zinc 
acetate (31). Polypeptides were also visualized by staining with Coomassie Brilliant Blue. 
Protein molecular weight standards were purchased from Bio-Rad Laboratories. 
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Absorbance and Fluorescence Spectrometry - Absorbance spectra were acquired on a 
computer-controlled, dual-beam X6 UV/Vis spectrophotometer (Perkin-Elmer Corp., Norwalk, 
CT). Absorbance of Anabaena cell suspensions was measured from 400 to 750 nm with a light 
beam passing through the frosted side of a 1-cm light path square cuvette. Fluorescence 
measurements were performed, on samples with a maximal absorbance of less than 0.07 to 
minimize the inner-filter effect, with a Perkin-Elmer MPF-44B fluorimeter coupled with an I/O 
board to a Macintosh computer for digitization and storage of data. Excitation and emission slits 
were set at 5 or 6 nm for all measurements. Excitation spectra were measured with emission 
observed at a wavelength 10 nm to the red of the sample's fluorescence emission maximum. 

Fluorescence quantum yield measurements, for samples isolated from SEC-HPLC, were 
made relative to cresyl violet (in ethanol, Of = 0.59; Eastman Kodak Co., Rochester, NY) using 
the K2 multifrequency phase and modulation fluorimeter (ISS, Champaign, IL) with excitation 
and emission slits set at 8 nm. All samples were diluted to peak absorbance between 0.01 and 
0.05, excited at 570 nm and emission acquired from 575 to 800 nm. These emission spectra were 
then instrument-corrected, converted to wavenumber scale, and bandpass corrected by 
multiplying emission intensity at each wavelength by the square of the respective wavelength 
(32). The resulting spectra were integrated and quantum yields calculated according to previously 
described equations (33). 

Molar extinction coefficients were determined according to previously established 
methods (34). In brief, absorbance spectra of samples isolated as trimers (unless otherwise noted) 
from SEC-HPLC were measured. These samples, having peak absorbance between 0.9 and 1.2, 
were diluted lOx with 8 M urea, pH 1.9 containing 10 mM DTT. The absorbance was measured 
at 660 nm for these denatured samples, and molar extinction coefficients were calculated for the 
original native samples from the known value of 35,400 M 1 cm 1 (at 660 nm) for each peptide- 
bound PCB in acid urea (35). Calculations assumed a stoichiometry of one PCB per a subunit 
and two PCBs per (3 subunit. Molar extinction coefficient of each peptide-bound PCB at 280 nm 
was measured using trimeric phycocyanins purified from SEC-HPLC. Samples in 50 mM Na- 
PO4 pH 7.0, with peak absorbance between 1.0 and 1.5 at 620 nm, were denatured with 9 M urea 
to give final 8 M urea, pH 2.0. The extinction coefficient of a protein-bound PCB at 660 nm was 
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identical to the value mentioned above, while that at 280 nm was calculated by subtracting 
contributions from Tyr [£280 nm = 1,370 M 1 cm" 1 ; (36)] andTrp [£280 nm = 5,500 M' 1 cnr 1 ; 
(37)] residues. 

Mass Spectrometry - For electrospray mass spectrometry, purified proteins (0.5 to 1 mg) 
5 were dialyzed extensively against 10 mM ammonium acetate prior to lyophilization. Proteins 
redissolved in 50% acetonitrile in 0.2% aqueous formic acid were analyzed with a VG Bio-Q 
mass spectrometer as described (38). Mass spectral analysis of different phycobiliprotein 
polypeptides, as well as of some other proteins expressed in Anabaena sp., showed quantitative 
posttranslational removal of the N-terminal Met in Anabaena sp. PCC7120. For this reason, 

10 amino acid residue numbering starts from the residue after the initial Met, and follows that used 
for numbering of bilin-linked residues in crystailographic studies of the highly homologous 
phycocyanin from Mastigocladus laminosus (39). However, our examination of 
phycobiliproteins from other cyanobacteria, as well as amino acid sequence data published by 
others (40, 41), showed that such posttranslational processing is not universal to cyanobacteria. 

15 Molecular modeling ofHis-tagged C-phycocyanin - The crystal structure of C- 

phycocyanin from the cyanobacterium Fremyella diplosiphon has been solved at 1.75 A 
resolution (42). The structural data for that phycocyanin, which is highly homologous to that of 
Anabaena sp. PCC7120, were used to build a molecular model of the Anabaena C-phycocyanin 
with the 24-amino acid N-terminal extension on the a subunit. Examination of the C- 

20 phycocyanin hexamer structure revealed a groove lying on the P face between P subunits. In 

order to present a reasonable graphical representation of the HTa:p model, we chose to lead the 
24-amino acid extension from inside to outside of the ring structure of phycocynin trimer through 
this surface groove. Although structural data supporting this choice is lacking, our results suggest 
that this is a plausible placement. Modeling of the 24-residue extension onto the C-phycocyanin 

25 hexamer was performed using the published monomer coordinates [(42); PDB accession code: 
lcpc]. The 24-residue tag was built onto the N-terminus of the a subunit as follows. First, the 
coordinates for a well-defined 25-residue peptide (the last one at C-terminus being Met) were 
excised from the published coordinates of another protein (antibody 48G7; PDB accession code: 
1GAF). Next, all side chains except for that of the last Met were truncated to Ala (or left as Gly). 
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Using the least-squares superposition of the program LSQMAN (43), the C-terminal Met of the 
25-mer peptide was overlaid with the N-terminal Met of the a subunit . The tag was then 
coarsely laid into the above-mentioned groove using the torsion commands of the program O 
(44), and the side chains mutated again to conform with sequence of the 24-residue tag of HTa. 
5 Again, using only the torsion commands, phi, psi, and chi angles on the 24-residue tag were 

manipulated so as to conform to the following constraints: (a) all non-bonded contacts involving 
tag-to-tag and tag-to-phycocyanin atoms must be in excess of 2.5 A; (b) large part of the tag must 
lay more or less in the (3-face groove; and (c) the six His residues must place externally to the 
hexamer so as to be solvent-accessible. The model was then subjected to 200 rounds of Powell 

10 energy minimization using the program XPLOR (45, 46), during which all of the original C- 
phycocyanin atoms were constrained from movement. This resulted in the torsion angles and 
non-bonded contacts of the 24-residue tag approaching canonical values. Quality of the model 
with respect to phi/psi angles was verified using the program PROCHECK (47). The 
Ramachandran plot showed that the modeled tag contained no disallowed phi or psi angles. To 

15 build trimers, symmetry operators were applied to the HTa: (3 monomer model using program O. 
The hexamer model was built by adding the tag coordinates to the second a subunit coordinates 

e 

of the lcpc structure and then applying the symmetry operators. 

Nomenclature - A His-tagged protein preparation purified by IMAC may contain more 
than one protein species. For example, both the apoprotein and holoprotein version of a His- 

20 tagged phycocyanin subunit may be present. Consequently, we refer to such a preparation by a 
nomenclature which specifies the organism in which it is being expressed (i.e., Anabaena sp. or 
E. coli), and the number of the pBS plasmid which contains the gene encoding the recombinant 
subunit. Thus, when His-tagged phycocyanin a subunit is expressed in Anabaena sp. PCC7120 
bearing the plasmid pBS168, the fraction purified by IMAC is designated Anl68. A fully 

25 characterized fraction, shown to consist of a single species, for example, pure His-tagged 

phycocyanin a subunit, is designated HTa. A mutant polypeptide, for example, a His-tagged 
Anabaena sp. PCC7120 phycocyanin a subunit with a Thr residue in place of an Ala residue at 
position 12, is designated HTa A12T . The absence or presence of the bilin chromophores is 

4 
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indicated by the prefixes "apo" and "holo", respectively. A His-tagged a subunit with native P 
subunit non-covalently associated is designated HTa:p. 

Table 1 summarizes the properties of each of iht Anabaena sp. PCC7120 wild-type and 
derivative strains and of expression vectors carried by these strains used for the production of 
5 His-tagged C-phycocyanin normal and mutant a and p subunits. 

Expression of His-Tagged C-Phycocyanin a Subunit in Wild-Type Anabaena sp. - The 
0.5-kb Ndel-cpcA-HindlR fragment from pBS185, encoding the wild-type phycocyanin a 
subunit, was cloned into pBS150, giving plasmid pBS168 encoding the a subunit with a 24- 
residue N-terminal extension. Cultures of Anabaena sp. PCC7120(pBS168) expressing 6xHis- 

10 tagged phycocyanin a subunit were very similar to wild-type cultures grown under identical 

conditions with respect to the relative amounts of phycobiliproteins and chlorophyll, and showed 
no apparent negative phenotypic characteristics. 

Isolation and characterization ofHTa - When cell lysate supernatant from Anabaena sp. 
PCC7120(pBS168) was passed through a Ni 2+ -NTA affinity column, some 30 to 40% of the cell 

15 phycobiliprotein (as estimated from A max at 620 nm) was retained on the column and then eluted 
with imidazole, a His-tag competitor. SDS-PAGE analysis of the His-tagged protein fraction 
showed two bands of 21.9 kDa and 19.5 kDa, corresponding to the calculated molecular weights 
of His-tagged C-phycocyanin a and native p subunits, respectively. Upon Zn 2+ -staining, both 
bands showed red fluorescence with near-UV excitation, indicating that each carried covalently 

20 linked phycocyanobilin. Identification of the two polypeptides was confirmed by mass 

spectrometry, which showed two components of 20,79 1.4±2.3 (holo-HTa minus the N-terminal 
Met) and 19,441±L3 (holo-P minus the N-terminal Met) in a 1:1 ratio. These mass values 
confirm the presence of one PCB on HTa and two PCBs on the P subunit. No other significant 
peaks, especially that corresponding to an apo-HTa, were present in the mass spectrum. These 

25 data indicate that the protein purified by MAC, Anl68, is phycocyanin holo-HTa:holo-p. 

The assembly forms of phycocyanin holo-HToc:holo-P and their apparent molecular 
weights were determined by analytical SEC-HPLC. As shown in Table 2, three components were 
present under our particular chromatographic conditions, a monomer (calculated mass 40.5 kDa) 
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at 49.2 kDa, a trimer (calculated mass 121.5 kDa) at 145.3 kDa, and a hexamer (calculated mass 
243 kDa). Over 90% of the protein was trimeric, (HToc:(3)3, the major assembly state observed 
under these conditions for native PC isolated from wild-type Anabaena sp. PCC7120. The SEC- 
HPLC data were corroborated by analytical ultracentrifugation, where most of the HToe:(3 was 
5 found to have a mass of 125.9 kDa, and the balance a mass of 34.8 kDa (Table 2). All three 

fractions from SEC-HPLC were shown by SDS-PAGE to contain holo-HToc and holo-|3 in a ratio 
of 1:1. Thus, the Anl68 His-tagged protein preparation purified by MAC consists of HToc:(3, and 
displays aggregation behavior typical of native PC under similar in vitro conditions. 

(HTa:p)3 isolated by SEC-HPLC had a A, max of 621 nm at protein concentrations >0.5 

10 mg ml -1 , and gave spectra with blue-shifted maxima (to as low as 615 nm) upon dilution (Table 
3). This is consistent with the blue shift in X max of wild-type phycocyanin when higher order 
assemblies dissociate to monomers as the protein concentration is lowered (48). Interestingly, at 
high protein concentrations, HTot:J3 displays a h max slightly red-shifted relative to that of native 
PC measured at similar concentrations. The £m of (HTa:P)3 was determined to be 9.21xl0 5 M 1 

15 cm -1 at 621 nm, similar to values observed for native phycocyanin from Anabaena sp. and other 
cyanobacteria [Table 3; (49)]. The excitation spectrum for 650 nm emission coincided nearly 
perfectly with the absorbance spectrum, indicating that the HTa:P preparation behaved as a 
single species with respect to spectroscopic properties. The kwas at 642 nm (with 560 nm 
excitation) with a O f of 0.22, similar to a value of 0.27 measured for native Anabaena PC (Table 

20 3). In summary, the spectroscopic characteristics of HTot:(3 are virtually identical to those of 
native Anabaena sp. PCC7120 C-phycocyanin. 

Incorporation ofHTa into Phycobilisomes - Since holo-HTa:holo-(3 represents between 
30 and 40% of the total phycobiliprotein in Anabaena sp. PCC7120(pBS168), it appeared likely 
that holo-HTot is effectively incorporated into phycobilisomes. This possibility was examined by 

25 analyzing phycobilisomes and their partial dissociation products for the presence of holo-HTa. 

Compared to the wild-type Anabaena sp. PCC7120 phycobilisome preparation, in the Anabaena 
sp. PCC7120(pBS168) preparation a lower percentage of the total phycobiliproteins was found in 
the phycobilisome fraction, while more was found in the rod substructure and the hexamer 
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fractions. This indicates that the phycobilisomes of the strain expressing holo-HTa are less stable 
under the conditions used for phycobilisome preparation. SDS-PAGE analysis showed that holo- 
HTa was present in all three fractions. 

* Anabaena sp. PCC7120(pBS168) phycobilisomes loaded on a Ni 2+ -NTA column (in the 
5 0.75M Na/K-P04 pH 7.5 buffer needed to maintain phycobilisome integrity, see "Materials and 
Methods"), bound to the column, although the binding capacity of the column under these 
conditions appeared low. The His-tagged phycobilisomes were eluted at high imidazole 
concentration. After removal of imidazole by dialysis, the SDS-PAGE polypeptide profile and 
spectroscopic properties of this phycobilisome fraction were very similar to that of the starting 
10 phycobilisome preparation. These results show that the 6xHis tags of the HTa subunits 

incorporated into phycobilisomes are relatively exposed, available for interaction with the Ni 2+ - 
NTA matrix. 

Expression of His-Tagged C-Phycocyanin (3 Subunit in Wild-Type Anabaena sp. - The 
0.5-kb Ndel-cpcB-EcoRl fragment from pBS251, encoding the wild-type phycocyanin p subunit, 

15 was cloned into pBS 150, giving plasmid pBS262 encoding the (3 subunit with the 24-residue N- 
terminal extension. Cultures of Anabaena sp. PCC7120(pBS262) expressing HT(3 were 
phenotypically very similar to those of Anabaena sp. PCC7120(pBS168) expressing HTa, except 
for a slightly yellowish appearance. Their whole-cell absorbance spectra showed a small decrease 
in the amount of phycobiliprotein per cell relative to chlorophyll. 

20 The yield of the His-tagged phycobiliprotein fraction from Anabaena sp. 

PCC7120(pBS262) was 16-18% of the total phycobiliprotein in the cell lysate, substantially 
lower than the 30-40% in the corresponding fraction from Anabaena sp. PCC7120(pBS 168) 
expressing HTa (see above). Analysis of the His-tagged phycobiliprotein preparation, An262, by 
SDS-PAGE showed the presence of holo-HT(3 and holo-oc in equal amounts. Mass spectral 

25 analysis showed that the two subunits carried the normal amounts of PCB, i.e., one PCB per a 
and two per HT[3 subunit, and no apo-polypeptides were present. 

In analytical SEC-HPLC, the ot:HTp holoprotein preparation isolated by IMAC was 
primarily trimeric (calculated mass 121.5 kDa) at 136.9 kDa. The shoulder/tail of that peak, 
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constituting less than 5% of the total His-tagged protein loaded, could be attributed to monomers 
(calculated mass 40.5 kDa) at 62.4 kDa (Table 2). Both fractions were shown by SDS-PAGE to 
contain holo-a and holo-HTp in a molar ratio of 1:1. In contrast to native PC and the HTa:(3 
holoprotein, the hexamer component was not observed for the oc:HTp holoprotein. Analytical 
ultracentrifugation data were consistent with two components observed in SEC-HPLC: a 
majority of the a:HTp holoprotein had a mass of 121.9 kDa, and a small fraction of 43.4 kDa, 
corresponding closely to the theoretical masses of trimers and monomers, respectively. 

The ^ max of the trimeric a: HTp holoprotein component lay between 615 and 619 nm 
depending on protein concentration (Table 3). As observed with HTa:p, X max shifted blue upon 
dilution. However, at high protein concentration the X max of the oc:HTP holoprotein, like that of 
native PC, did not exceed 619 nm, in contrast to the ?i max of 621 nm observed for HToc:p under 
similar'conditions. The £m of oc:HTP trimers was found to be 8.88xl0 5 M" 1 cm" 1 at 619 nm 
(Table 2), a slightly lower value than that observed for HTa:p. The excitation spectrum for 650 
nm emission corresponded nearly perfectly to the absorbance spectrum, indicating that the 
a:HTp preparation behaved as a single species with respect to spectroscopic properties. The 
A,was at 642 nm (with 560 nm excitation), with a <D F of 0.23 (Table 2). Thus the spectroscopic 
characteristics of oc:HTP are virtually identical to those of native Anabaena sp. PCC7 120 C- 
phycocyanin and its HTa:p counterpart. 

A sucrose density gradient preparation of phycobilisomes from Anabaena sp. 
PCC7120(pBS262) expressing HTp showed distribution of phycobiliprotein aggregates different 
from that of the wild-type cells. A much smaller proportion of the phycobiliprotein (25%) 
sedimented in the phycobilisome fraction, with 40% and 35% in the rod and hexamer fractions, 
respectively. SDS-PAGE analysis showed that HTp was present in all three fractions. However, 
less HXP (relative to the other phycobiliproteins) was found in phycobilisomes compared to the 
amount found in the products of phycobilisome dissociation, rods and hexamers, suggesting a 
reduced stability of phycobilisomes that have incorporated more HTp subunits. As described 
above fox Anabaena sp. PCC7120(pBS168) expressing HToc, His-tagged phycobilisomes could 
be isolated from the Anabaena sp. PCC7120(pBS262) phycobilisome fraction by MAC, 
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although in a relatively poor yield. The low yield might be due to the smaller number of HTp 
subunits present per phycobilisome, a reflection of the smaller amount of holo-HTp relative to 
holo-p produced per cell. 

Highly purified (HTa:(3)3 and (a:HTp)3 phycocyanins were used to measure the 
extinction coefficient of peptide-bound PCBs in 8 M urea pH 2.0. Two different preparations 
were measured for each protein, and four measurements gave very similar values. The molar 
extinction coefficient for one peptide-bound PCB at 660 nm is very close to the value of 35,400 
M 1 cm 11 (35). Upon subtraction of contributions from Trp and Tyr residues, the extinction 
coefficient for one peptide-bound PCB at 280 was determined to be (14.85±0.1)xl0 3 M _1 cm -1 
(Table 3). 

Expression ofHTa and HT$ in a cpcB AC background - The separate overexpression of 
HTct and HTp in phycocyanin-minus backgrounds was examined in mutant strain B646 (7). This 
mutant has a transposon inserted between the promoter and the CpcB open reading frame, which 
is expected to eliminate transcription of cpcBAC, genes encoding the P and a subunits of 
phycocyanin and a rod linker polypeptide, respectively (5). Strains B646(pBS168) and 
B646(pBS262) (see Table 1), generated by introduction of pBS168 and pBS262 into strain B646 
by conjugation, were phenotypically indistinguishable from the parent mutant strain. Both 
phycocyanin-deficient strains grow slowly and produce very low levels of the His-tagged 
polypeptides. 

SDS-PAGE analysis of the His-tagged phycobiliprotein fractions from strain 
B646(pBS262) showed no a or P subunits of wild-type phycocyanin. However, multiple attempts 
at purifying the HToc from strain B646(pBS168) yielded fractions containing some native P 
subunit, presumably resulting from a basal leaky transcription of the cpcBAC operon. 

On SEC-HPLC fractionation, the IMAC-purified proteins from B646(pBS262), An262- 
BAC, eluted as a single peak of subunit homodimers, i.e., (HTP)2. Purified native p subunits of 
C-phycocyanin dimerize at concentrations of 1-10 |jM (34, 50). Since mass spectrometry showed 
that the HTp subunits produced in this cpcBAC background carried a full complement of PCB, it 
is evident that bilin addition to phycocyanin p subunit is independent of the presence of the 
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phycocyanin apo- or holo-a subunit. The mass spectral analysis also indicates the presence in 
HTp of the posttranslational methylation at the y-amino group of the Asn 72 residue normally 
found in phycocyanin (5 subunits (51, 52). 

Under similar SEC-HPLC conditions MAC-purified protein preparation from 
5 B646(pBS168), Anl68-BAC, fractionated into three species: trimer, monomer, and subunit. 

SDS-PAGE analysis of the fractions indicated that the trimer and monomer fractions contained 
HToc and holo-p in equimolar amounts. The subunit fraction contained only HToc, the majority of 
which appeared to be apo-HTa based on the relative intensities in the absorption peaks at 280 nm 
and 618 nm, and from comparison of Zn 2+ - and Coomassie-staining intensities of the An 168- 

10 BAC trimer and monomer fractions (HTa:P) run in parallel on the same gel. 

Analyses of the His-tagged protein preparations from strains B646(pBS168) and 
B646(pBS262) indicate that overexpression of HTp in the cpcBAC background yields a soluble 
protein where the HTP subunit is stabilized by homodimer formation. The results also indicate 
that HTa appears to be unable to form subunit homodimers in the cell, and may require the 

15 presence of p subunit that is extremely limited in the cpcBAC background for stabilization. The 
greatly reduced chromophorylation of HTa protein isolated from this cpcBAC strain may also 
reflect a possible requirement of subunit dimerization (a:p, a:a, or p:p) prior to covalent 
attachment of PCB, or misfolding of a significant fraction of apo-HTa in the absence of apo- or 
holo-p. 

20 The HTa and HTp subunits isolated from mutant strains B646(pBS 168) and 

B646(pBS262) have absorbance spectra and molar extinction coefficients (Table 3) similar to 
those reported for renatured a and P subunits of native C-phycocyanin, obtained by 
chromatographic separation under denaturing conditions (34, 50). Algebraic addition of the HTa 
and HTP spectra yielded a spectrum with shape similar to that of monomeric PC but with a blue- 

25 shifted maximum at 61 1 nm, consistent with previous observations (34). Incubation of the HTa 
and HTP subunits together overnight at 4°C, followed by fractionation by SEC-HPLC, yielded a 
trimer fraction with absorbance and fluorescence excitation and emission spectra similar to the 
corresponding spectra of HTa:P , a:HTP, and native PC. This indicates that holo-HTa (but not 
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apo-HToc, see below) interacts preferentially with holo-HT(3 to form trimers, and His-tagged 
holophycocyanin subunits isolated in the absence of their cognate subunit partners can be 
reconstituted to yield trimers with properties similar to those of native phycocyanin. 

Expression of His-tagged phycocyanin subunits in an apophycocyanin a subunit PCB 
5 lyase-deficient background - A dimeric lyase, CpcEF encoded by genes cpcE and cpcF, 

specifically catalyzes the addition of PCB to Cys 84 of the phycocyanin a subunit (1,3). We have 
obtained two mutants through transposon mutagenesis, B64328 with an insertion in the cpcE 
gene, and B64407 in the cpcF gene. Both mutants had greatly reduced cellular content of normal 
phycocyanin (7). Expression of His-tagged phycocyanin subunits in the mutant strains provided a 

10 simple way of assessing how mutations in each of the two CpcEF lyase subunits affect PCB 
addition to the phycocyanin a subunit, and how cognate PC proteins behave in the cell. 

'Expression ofHTa in either a cpcE or a cpcF background - To express the HTa 
polypeptide in either a cpcE or cpcF background, plasmid pBS168 was introduced into both 
mutants, giving strains B64328(pBS168) and B64407(pBS168) (see Table 1). The two strains 

15 showed the yellowish-green phenotype characteristic of their respective parent mutants. The His- 
tagged phycobiliprotein fraction from either mutant strain amounted to < 5% of the total 
absorbance of the cell lysate at 607 nm. The His-tagged protein purified from the cpcF strain 
B64407(pBS168), Anl68-F, was shown by SDS-PAGE and mass spectral analyses to be apo- 
HToc:holo-p, with no detectable holo-HTcc. In contrast, the Anl68-E protein contained apo- 

20 HTa:holo-p along with a small amount of holo-HTa:holo-(3. On SEC-HPLC, apo-HTa:holo-(3 is 
seen to form monomers preferentially, in contrast to the preferred trimer formation by holo- 
HTa:holo-p. 

Sucrose density gradient sedimentation patterns of phycobilisome preparations from 
strains B64328(pBS168) and B64407(pBS168) were very similar to those given by the parental 
25 strains B64328 and B64407 (7). In both cases, the phycobilisome fraction consisted largely of the 
allophycocyanin-containing phycobilisome cores carrying much reduced amounts of rod 
components (relative to phycobilisomes from wild-type cells). SDS-PAGE analysis showed the 
presence of apo-HToc in the phycobilisome and rod/hexamer fractions and an amount of 
phycoerythrocyanin, a distal component of Anabaena phycobilisome rod substructures, greater 
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than that seen in wild-type phycobilisomes (7, 53). The relative amount of apo-HTa was higher 
in the phycobilisome fraction than in the rod/hexamer fraction, suggesting that apo-HTa 
assembled into phycobilisomes might be turned over more slowly in the cell. 

The finding that apo-HTa: holo-p assembles into phycobilisomes is consistent with earlier 
5 observations of such incorporation of the phycocyanin apo-a subunit in cpcE and cpcF mutants 
of the unicellular cyanobacterium Synechococcus sp. PCC7002 (3, 4). We showed previously 
that the Anabaena sp. PCC7120 mutant strains B64328 (cpcE) and B64407 (cpcF) incorporated 
the phycocyanin apo-a:holo-P into phycobilisomes, and that this "defective" phycocyanin still 
allowed the assembly into phycobilisomes of the distal phycoerythrocyanin component (7). The 

10 experiments reported here extend that observation to phycocyanin apo-HTa:holo-(3 proteins. 

Expression ofHTfi in either a cpcE or a cpcF background - To express the HT(i 
polypeptide in either a cpcE or cpcF background, plasmid pBS262 was introduced into both 
mutants, giving strains B64328(pBS262) and B64407(pBS262) (see Table 1). Yield of His- 
tagged holophycobiliproteins from either mutant strain was low, accounting for < 7% of the 

15 absorbance of the cell lysate at 607 nm. In contrast to results obtained with the mutant strains 
expressing HTa (see above), when HT(5 was expressed, SDS-PAGE analysis showed that the 
His-tagged protein preparations contained apo-a and holo-HTP in a molar ratio of approximately 
1:2. SEC-HPLC fractionation showed presence of apo-a:holo-HT(5 trimers and monomers, and 
of (holo-HTp)2 homodimers. The apo-a:holo-HTp trimers had spectroscopic properties virtually 

20 identical to those of apo-HTa: holo- (3 trimers described above. Analyses of phycobilisomes and 
of rod/hexamer fractions showed that some of the holo-HT(3 (likely in the form of apo-a: holo- 
HTP) was incorporated into both fractions. 

Expression of His-tagged mutant phycocyanin subunits - The favorable expression and 
incorporation of His-tagged wild-type phycocyanin subunits in Anabaena sp. provides a 

25 convenient means to perform in vivo and in vitro studies of mutant phycobiliprotein subunits. 
Here we present two examples. 

Expression of HTa A12T - One of the cpcA clones generated by PCR using the Taq DN A 
polymerase was found to have a single G -> A base change leading to an Ala 12 -> Thr mutation 
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in the translated protein. The Ala 12 residue is conserved in all sequenced phycocyanin a subunits 
(19). The crystal structure of phycocyanin predicts that replacement of Ala 12 by a residue with a 
larger side chain would interfere with oc(3 heterodimer formation by steric hindrance of the 
interaction of a-Asp 13 with p-Tyr 97 (54). We expressed the HTa A12T mutant subunit in wild- 
5 type Anabaena sp. PCC7120 to examine this prediction. 

The mutant cpcA gene was cloned into pBS150, giving plasmid pBS167. Anabaena sp. 
PCC7120(pBS167) expressing HToc A12T was phenotypically very similar to strain Anabaena sp. 
PCC7120(pBS168) (see Table 1). However, the His-tagged protein fraction was obtained in very 
poor yield from Anabaena sp. PCC7120(pBS167): < 0.15% of the phycobiliproteins in the cell 

10 lysate as estimated from A620 nm- While spectroscopically similar to the An 168 His-tagged 

protein preparation, mass spectral analysis of the An 167 His-tagged protein preparation showed 
two components in similar amounts, with masses corresponding to those of holo-oc and holo- 
HTa A12T . These results lead to two conclusions. First, the mutant HTa A12T subunit appears to 
have a greatly reduced affinity for the holo-P subunit. Most of the HTa A12T is likely to be present 

15 as free subunits in the cell and to be degraded rapidly. HTa A12T also seems to have a low affinity 
for holo-a, and is consequently stabilized in vivo by the formation of holo-a:holo-HTa A12T 
dimers. Second, the in vivo covalent attachment of PCB to the phycocyanin a subunit evidently 
does not require that it form a complex with the (3 subunit. 

Sucrose density gradient preparation of phycobilisomes from strain Anabaena sp. 

20 PCC7120(pBS167) gave a banding pattern similar to that from the wild type. The HToc A12T 

subunit was not detected in phycobilisome and rods fractions, suggesting that HTa A12T was not 
assembled into the phycobilisomes (although the very low amount of HToc A12T may have evaded 
detection). 

Expression of HT$ S46G > N76D - One of the cpcB fragments, PCR-amplified using the Tag 
25 DNA polymerase, contained two mutations: both an A -> G change, in bases number 139 and 
229, respectively, of the published sequence (5). The changes resulted in the replacement of 
residue Ser 46 with Gly, and of residue Asn 76 with Asp in the CpcB protein. Anabaena sp. 
PCC7120(pBS162) expressing the HTp S46G > N76D mutant PC subunit was phenotypically 
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identical to the one expressing His-tagged wild-type p subunit. The HTp S46G ' N76D mutant 
protein behaved the same way as the His-tagged wild-type counterpart in all aspects tested (yield, 
assembly states in vitro, assembly into phycobilisome in vivo, etc.) except that the purified 
protein, An 162, had an absorption maximum at 610 nm, about 8-nm blue-shifted compared to 
5 that of the An262 protein. Fluorescence emission maximum of the An 162 protein, however, was 
still at 642 nm. 

Ser 46 of the phycocyanin P subunit, situated in the coil region connecting helices A and B 
(39, 54), is not in contact with any chromophore. Its non-conserved substitutions include Gly in 
some phycobiliprotein subunits. It therefore may be assumed that the Ser 46 -> Gly change would 

10 not have obvious effects on the protein. The Asn 76 residue, on the other hand, is highly 

conserved among multi-chromophore p subunits of phycobiliproteins (19), and its sidechain is in 
contact with ring D of the cx-84 PCB chromophore of PC trimers, possibly involved in 
maintaining its stretched confirmation in trimer aggregates (55). The neighboring Arg 77 residue, 
also highly conserved, forms a hydrogen bond with the propionic acid substituent of ring B of the 

15 P-84 PCB (54, 55). In the mutant protein, replacement of Asn 76 by Asp 76 sidechain may interfere 
with that hydrogen bonding, thereby affecting absorbance of P-84 PCB. A reduction of 
absorbance of p-84 PCB would shift the X max of the cognate phycocyanin to the blue (50). 

Molecular modeling of His-tagged C-phycocyanin - Based on crystallographic data 
obtained from the cyanobacterium Fremyella diplosiphon C-phycocyanin (42) that is highly 

20 homologous to that of Anabaena sp. PCC7 120, a molecular model was built for the Anabaena 

sp. PCC7120 C-phycocyanin incorporating the 24-residue N-terminal extension. In this particular 
model, the N-terminal extension can be led from inside to outside of the ring structure of the 
phycocyanin trimer, through a groove between P subunits on the p side of the trimer. This allows 
unhindered a face-to-a face stacking of two trimers to form a hexamer. The 24-residue N- 

25 terminal tag, occupying only the groove space in the p faces, also does not appear to interfere 

with stacking of hexamers on p faces to form rods of the phycobilisome. The first N-terminal 13 
of the 24 residues of the tag, that include the six His residues, are completely exposed outside of 
the rod surface, consistent with the observed affinity purification of phycobilisomes by IMAC (as 
described above). 

28 B00-016-3 



In summary, methodology developed in this study allows (a) high level expression of 
holo-HToc and holo-HTp in Anabaena sp. PCC7120 and a variety of its mutant derivatives; (b) 
facile affinity purification of the His-tagged polypeptides and of complexes into which they are 
incorporated, including intact phycobilisomes; (c) study of in vivo assembly and bilin addition; 
and (d) analysis of site-specific mutants of C-phycocyanin. It is evident that the approaches 
described here are immediately generalizable to other Anabaena phycobiliproteins, such as 
allophycocyanin and phycoerythrocyanin (3, 7, 29). 

With appropriate choice of organism, these approaches can be applied to the expression 
and study of other phycobiliproteins including phycoerythrins. Finally, the studies described here 
provide for the design and expression of diverse fusion proteins in Anabaena sp. with important 
outcomes for the successful production of modular fluorescent phycobiliprotein tags and for the 
exploration of heterologous protein folding in cyanobacteria (9, 10). 

Example B. Recombinant phycobiliprotein fusion proteins with oligomerization and bio specific 
recognition domains. Here we describe the design and expression of more complex recombinant 
phycobiliprotein constructs which incorporate oligomerization and biospecific recognition 
domains and in some of which the a and p subunits are covalently bridged. Materials and 
methods essentially as described in Example A are not restated. 

Assays of protein binding to streptavidin - Western hybridization was carried out 
essentially as described (14). Proteins separated on SDS-PAGE were transferred to 
polyvinylidene difluoride (PVDF) membrane (Immobilon; Millipore Corp., Bedford, MA) using 
a mini Trans-Blot cell (BioRad Laboratories). Electrophoretic blotting was carried out in ice- 
cooled transfer buffer for 1 hr at 100 V. The PVDF membrane was then blocked with 3% bovine 
serum albumin in PBS buffer (115 mM NaCl, 4 mM KH 2 P0 4 , 16 mM Na 2 HP0 4 , pH 7.4) plus 
0.5% Tween-20 (Sigma Chemical Co.). Binding of membrane-bound proteins to streptavidin was 
assayed by incubating the membrane at 25 °C for 1 hr in PBS buffer plus 0.1% Tween-20 and 
l:5000-diluted streptavidin-alkaline phosphatase conjugate (Prozyme Inc., San Leandro, CA). 
Alkaline phosphatase-mediated color development on the membrane using chromogenic 
substrates 5-bromo-4-chloro-3-indolyl phosphate (BCIP) and nitro blue tetrazolium (NBT) was 
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carried out as described (14) except that 200 mM Tris-HCl pH 8.8 was used as the developing 
buffer. 

Alternatively, streptavidin-coated agarose beads were used to test binding of Strep2- 
tagged phycocyanin. As has been described (56), beads coated with recombinant core 
5 streptavidin (Biometra Inc., Tampa, FL) gave better results than those coated with natural 

streptavidin (Sigma Chemical Co. or Prozyme Inc.). Streptavidin-coated beads were incubated at 
4 °C for about 20 min with excess amount of purified Strep2-tagged phycocyanin, washed twice 
with buffer W (100 mM Tris-HCl pH 8.0, 1 mM EDTA), and visualized by fluorescence. 
Phycocyanin-labeled beads were observed either in batch with ultraviolet (>3 12 nm) excitation of 

10 fluorescence viewed through a 550 nm long-pass filter, or under an epifluorescence microscope 
(the BH-2 system from Olympus America Inc., Lake Success, NY) with excitation light through 
a 450-480 nm band-pass filter and emission observed through a 515 nm long-pass filter. 

Construction of modular cloning vectors for protein expression - A series of cloning and 
expression vectors was designed and constructed for expression of fusion proteins with different 

15 functional domains and tags on the N-terminus. To facilitate shuffling of the functional domains 
to produce desired combinations, DNA cassettes coding for these domains were designed as 
exchangeable modules. Care was taken in the design to ensure that only optimal codons (in 
respect to E. coli mdAnabaena sp. PCC7120) were used, and that no restriction sites (for 6-bp 
cutters) were present in the core sequences, except for designed-in signature sites. 

20 The expression vector pBS 150v [Example A; GenBank Accession number AF1 77932] 

was used as a template in the engineering of functional domains. Generally, a pair of designed 
oligonucleotide primers was used to run inverse PCR, producing a new plasmid in which the 
sequence from bp 3892 to 3938 (encoding the 6xHis affinity tag and a spacer) of pBS150v is 
replaced by a desired functional module. 

25 Replacement with the 56-bp NcoI-BspMIl Strep2 module generated plasmid pBS283v 

(4,644 bp; Table 4). The 10-residue Strep2 tag is able to bind specifically to streptavidin (57). 

Replacement with the 146-bp Ncol-Agel combination module (6xHis plus GCN4-pII) 
gave plasmid pBS31 lv (4,734 bp; Table 4), and with the 164-bp combination module (Strep2 
plus GCN4-pLI) gave pBS303v (4,752 bp; Table 4). Sequences for the GCN4-pH and GCN4-pLI 
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coiled coil domains were modified from part of the original Saccharomyces cerevisiae GCN4 
sequence encoding the leucine zipper domain (58, 59), to conform with the pll sequence 
(positions a and d are both He) or the pLI sequence (positions a and d are Leu and lie, 
respectively) (60). A BglU site was engineered into the GCN4-pH coding sequence as a signature 
5 site, as was an AsuR site in GCN4-pLL 

With plasmids pBS283, pBS303, and pBS31 1 in hand, four functional domains, 6xHis, 
Strep2, GCN4-pH, and GCN4-pLI, are available for further manipulation. Since the different 
functional domains are modular in design, different cloning and expression plasmids with 
various combinations of the modules were easily made by simple recombination cloning (Table 

10 4). All resulting plasmids have the modular region end with the second half of either the BspMR 
or the Agel site, encoding a Gly, immediately upstream from the recognition and cleavage 
sequence for the tobacco etch virus (TEV) endoprotease (Example A, herein, 23, 61). The 7- 
residue TEV site was retained in all plasmids to act as a spacer between the modular functional 
domains and the multiple cloning sites into which genes of interest are inserted, and to provide a 

15 means of removing the modular functional domains from expressed fusion proteins if so desired. 
The LacZ a domain downstream from the TEV site was found to be functional in all cognate 
plasmids, allowing blue/white screening of inserts in the multicloning region. The multicloning 
region of these plasmids can be greatly enhanced in lieu of the LacZ a function by recombination 
with plasmid pBS152v [(Example A, herein); GenBank Accession number AF177933]. For 

20 plasmids encoding phycocyanin subunits, the Ndel-cpcA-HindM and Ndel-cpcB-EcoRI 

fragments were taken from plasmids pBS185 and pBS251, respectively (Example A, herein). 

Construction of genes encoding phycocyanin subunit-fusion monomers - X-ray 
crystallographic data obtained from the highly homologous C-phycocyanin from another 
cyanobacterium Fremyella diplosiphon (42) were used to design subunit fusions of the Anabaena 

25 phycocyanin, see Example A. Straight-line distance between a carbons of the last residue of a 
subunit and the first residue of (3 subunit was measured to be 29.5 A. The distance between the 
last residue of p subunit and the first residue of a subunit was 32.6 A. Although the average sum 
of three chemical bonds' length of a peptide unit is 4.3 A, the actual straight-line distances 
between two a carbons were found to range from 3.2 to 3.7 A among several peptide units 
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measured in the crystal structure. An 1 1 -residue linker was therefore chosen to link the C- 
terminus of one subunit to the N-terminus of another. The 1 1 -residue linker is sufficiently long to 
bridge the distance (29.5 to 32.6 A) between the N- and C-termini of the two subunits, and 
provides some leeway for bending should this be required for higher-order assembly of the 
5 resultant fusion phycocyanins. 

One obvious difference between phycocyanins of Fremyella diplosiphon and of 
Anabaena sp. PCC7120 is the lack of the initial Met residue in the Anabaena subunits as a result 
of posttranslational modification (Example A, herein). To reduce possible steric interference in 
the engineered Anabaena protein, an Ala was inserted in place of the missing Met at the N- 

10 terminus of the subunit following the fusion linker. Since both a and p subunits have an oc- 

helical C-terminal region made up mostly of small hydrophobic residues, Ala was chosen as the 
first residue in the fusion linker to maintain the local hydrophobic environment. The rest of the 
linker peptide (LI 1) was designed for high structural flexibility, and consists mostly of Gly 
residues and two Ser. Incorporation of the signature site XmnI in the LI 1 coding sequence not 

15 only facilitates characterization of PCR products, but also allows easy construction of fusions of 
other proteins to the C-terminus of phycocyanin a or P subunits (Example C, herein). 

Designed oligonucleotide primers incorporating the LI 1 linker sequence were used for 
inverse PCR with template DNA consisting of both pBS185 and pBS251 plasmids (Example A ? 
herein), giving plasmids pBS310 (Ndel-cpcA-LU-cpcB-EcoRl in pUC19) and pBS307 (Ndel- 

20 cpcB-Ll 1-cpcA-HindHT). Once sequenced to confirm fidelity, the 1047 -bp Ndel-EcoRl fragment 
of pBS310 was cloned into Ndel- and £c6>RI-digested pBS150 (Example A, herein), giving 
plasmid pBS320 which encodes 6xHis-tagged CpcA-Ll 1-CpcB fusion phycocyanin. In like 
manner pBS315 was made to encode the 6xHis-tagged CpcB-Ll 1-CpcA fusion protein (Table 4). 
A PCR-generated spontaneous mutant product, in which 6 bases in the signature XmnI site of the 

25 LI 1 linker sequence had been deleted (giving the L9 linker), was also used in this study 
(pBS319;seeTable4). 

All other molecular biological manipulations followed standard procedures (14). To 
ensure sequence fidelity, all DNA fragments that had gone through the PCR process ( 16) were 
sequenced using the PRISM 373 DNA sequencing system and the dye-terminator cycle- 
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sequencing kit from Applied Biosystems (Foster City, CA). Analyses of DNA and amino-acid 
sequence data were performed with the programs Editbase (Purdue Research Foundation and 
USDA/ARS) and Lasergene (DNASTAR Inc., Madison, WI). 

Nomenclature - A recombinant polypeptide encoded by an expression plasmid is referred 
5 to by the plasmid number, and the His-tagged protein fraction isolated by affinity 

chromatography is designated by the organism in which the recombinant polypeptide is 
expressed. For example, a phycocyanin a subunit with both 6xHis and Strep2 tags encoded by 
plasmid pBS327, expressed in E. coli and purified by MAC is designated Ec327. The purified, 
characterized protein is then designated as HT-Strep2-oc. The same construct expressed in 

10 Anabaena sp. PCC7120 is designated An327. However, the purified protein is HT-Strep2-oc:(3, 
because in Anabaena sp. the HT-Strep2-a forms a stable complex with the holophycocyanin (3 
subunit. Phycobiliproteins expressed in E. coli are apoproteins (lacking attached phycobilins), 
whereas those expressed in Anabaena sp. are holoproteins, unless otherwise noted. 

Design of subunit-fusion phycocyanins - A His-tag at the N-terminus of either an a or a (3 

15 subunit does not interfere with the folding of these polypeptides into their native structures 
(Example A, herein). Two possible a/p fusions can be generated with an 1 1 -residue (L-l 1) 
linker: HToc-Ll 1-p and HTp-Ll 1-a. In either construct, one subunit, the a or the b, would have a 
24-amino acid extension (including the 6xHis tag) at the N-terminus and a large fusion, linked 
through LI 1, at its C-terminus. Both constructs were made to determine the folding and 

20 spectroscopic properties of such fusions. 

Expression of His-tagged phycocyanin (i- a fusion protein in E. coli - HTb-Ll 1-a is 
expressed in E. coli as the apoprotein (lacking the three phycocyanobilins). At 37 °C, in cultures 
induced with IPTG for 5 hrs, the fusion protein (Ec315) represented over 5% of total cellular 
proteins (>7.5 mg per liter of culture at 10 9 cells ml" 1 ), and at 30 °C, in cultures induced with 

25 IPTG for 12 hr, some 20% of total cellular proteins. HTp-Ll 1-a purified by MAC could be 

concentrated up to 0.2 mM protein (nearly 8 mg ml" 1 ) in 50 mM Tris-HCl pH 8.0, 50 mM NaCL 
50 mM KC1, 1 mM DTT, and 5% glycerol. The yield and solubility were markedly higher than 
that obtained upon expression of the individual His-tagged phycocyanin subunits, HToe (Ec 168) 
and HTp (Ec262), in E.coli (Example A, herein). With IPTG induction at 30 °C, in the latter 
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cases, yields were about 0.1 mg per liter of culture. Upon IPTG induction at 37 °C, -60% of the 

individually expressed 6xHis-tagged subunits remained in solution while the balance formed 

inclusion bodies. Thus coexpression of the apo-oe and apo-P subunits, inherent in the expression 

of HTP-L1 1-a, evidently promotes folding and concomitant retention of solubility of the 

recombinant apoprotein. 

Expression of His-tagged phycocyanin P-a fusion protein in Anabaena sp. - Cultures of 

Anabaena sp. PCC7120(pBS315) expressing HTp-Ll 1-a appeared more blue than that of the 

wild type. Whole-cell absorbance spectra showed a much higher ratio of 

phycocyanimchlorophyll a. Despite appearing very healthy, Anabaena sp. PCC7120(pBS3 15) 

cultures grew about 30% more slowly than the wild type. 
» 

When cell lysate supernatant from Anabaena sp. PCC7120(pBS315) was passed through 
a Ni 2 +-NTA affinity column, usually > 33% of the total phycobiliproteins (as estimated from 
^620 nm) was retained on the column and then eluted with imidazole (resultant protein denoted 
An315). Such high yields are comparable to that of the strain expressing HTa (> 30% of A 62 o nm )- 
and better than that of the strain expressing HTP (~ 16%) (Example A, herein). 

SDS-PAGE analysis showed that the An315 fraction consisted almost entirely of HTP- 
Ll 1-a, along with a small amount (<10%) of native phycocyanin a and p subunits in a - 1:1 
ratio. This finding suggests that the fusion construct interacts relatively normally with 
unmodified phycocyanin in the cell. Such interaction to form higher assemblies is supported by 
analysis of Anabaena sp. PCC7120(pBS315) phycobilisomes, which were indeed found to 
contain HTp-Ll 1-a. Much more of the fusion protein was found in the phycobilisome 
substructures (rods and hexamers/trimers) than in intact phycobilisomes, suggesting that 
phycobilisomes with a higher content of HTp-Ll 1-a may be less stable during the preparation 
procedure. The high amount of HTp-Ll 1-a relative to native phycobiliproteins in the 
hexamer/trimer fraction also suggests that the construct is turned over more slowly in the cell. In 
this context, it is noteworthy that the whole-cell phycobiliprotein fluorescence emission of 
Anabaena sp. PCC7120(pBS315) was twice as high as that of strains expressing His-tagged a or 
P subunit, and the growth media of this strain often turned blue, the color of phycocyanin. 
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On SEC-HPLC the An315 preparation fractionated into hexamers and trimers only, with 
no monomers observed (Table 5). This result parallels those obtained previously with An 168 
(HTa:p) and An262 (oc:HT(3), where the a and p subunits are not covalently linked (Example A, 
herein). The trimer component, (HT(3-L1 1-0O3, had a ?i max at 622 nm at >10~ 6 M protein. (HT(3- 
Ll I-0O3 dissociated to monomers at very low protein concentrations, with absorption maxima 
shifting blue to as low as 615 nm (Table 6), a behavior characteristic of native phycocyanin and 
of (HToc:(3)3 and (a:HTP)3 holoproteins. At high dilution, HTp-Ll 1-a has an A ma x- A360 nm of 
6.2, similar to the values observed for monomeric wild-type and His-tagged a:p phycocyanins 
(Table 6; Example A, herein). The £for (HTp-Ll I-0O3 at >10~ 6 M protein concentration was 
determined to be 900,000 M _1 cm* 1 . The fluorescence emission spectrum for 560 nm excitation 
of HTp-Ll 1-a, with ?iat 643 nm and a Of of 0.21, were virtually identical to those of wild-type 
phycocyanin. The fluorescence excitation spectrum for 655 nm emission corresponded well with 
the absorbance spectrum (Table 6). The above data show that the HTp-Ll 1-a fusion 
phycocyanin has properties virtually identical to those of the wild-type and His-tagged a:p holo- 
proteins, and indicate that HTp-Ll 1-a expressed in Anabaena sp. has a full complement of 
bilins, i.e., one PCB per a and two per P subunit. 

Mass spectral analysis of the SEC-HPLC trimer fraction, (HTp-Ll l-a)3, gave a value of 
40,993.2±6.8 which corresponds well to the theoretical value (41,1 18.8 Dal) of the holoprotein 
HTp-Ll 1-a with the initial Met residue (131.1 Dal) removed. The mass spectral analysis not 
only confirms the full posttranslational chromophorylation on both the a and p subunit moieties, 
but also indicates presence of the posttranslational methylation on the y-amino group of the 
Asn 72 residue of the P subunit domain (51, 52). 

Expression of His-tagged phycocyanin a-p fusion protein in Anabaena sp. - Like HTp- 
Ll 1-a apoprotein (Ec315), HTa-Ll 1-p (Ec320) was expressed in £. coli in very high yield in 
soluble form. The phenotype and growth rate of cultures of Anabaena sp. PCC7120(pBS320) 
expressing HTa-Ll 1~P were similar to those of strains expressing only His-tagged a or P 
subunits. The whole-cell absorbance spectrum of Anabaena sp. PCC7120(pBS320) was similar 
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to that of the wild type but with a slightly higher and blue-shifted contribution from phycocyanin, 
and abOut 50% higher whole-cell phycobiliprotein fluorescence was observed. 

Like HTp-Lll-oc (An315), HTa-Ll 1-P (An320) was also expressed at very high yield in 
Anabaena sp., often accounting for > 32% of total cellular phycobiliproteins. SDS-PAGE 
5 showed that affinity-purified An320 proteins were almost entirely HTa-Ll l-(3, with a small 
amount of copurified native phycocyanin a and P subunits. The ratio of copurified a to (3 
subunits was ^ 2, suggesting a reduced affinity of HTa-Ll 1-p for native phycocyanin (3 subunits. 

On SEC-HPLC, the An320 fraction separates into three components: monomers, trimers, 
and hexamers. It is noteworthy that native C-phycocyanin under the same chromatographic 

10 conditions was mainly trimeric with very little monomer, whereas the HTa-Ll 1-0 preparation 
was a - 1:1 mixture of trimers and monomers (Table 5). The theoretical molecular weights from 
these components are 13, 19, and 9% higher, respectively, than the values observed (Table 5). 
These discrepancies indicate that the radius of gyration of HTa-Ll 1-p is smaller than that of the 
wild-type phycocyanin ap monomer. 

15 At >10~ 6 M protein concentration, the SEC-HPLC trimer and monomer fractions of HTa- 

Ll 1-P had A, max at 608 and 605 nm, respectively. This likely accounts for the blue-shifted 
phycobiliprotein contribution to the whole-cell absorbance spectrum of Anabaena sp. 
PCC7120(pBS320). Like wild-type phycocyanin, the trimer fraction showed a blue shift in its 
absorbance spectrum down to 605 nm with decreasing protein concentration, indicative of 

20 dissociation of the trimers to monomers. At high dilution, HTa-Ll 1-P has an A max : A36O nm of 
5.2, substantially lower than the value of 6.4 observed for monomeric wild-type and HTa:P 
phycocyanins (Table 6; Example A). 

The unusual characteristics exhibited by the HTa-Ll 1-P protein purified from Anabaena 
sp. were similar to those of apo-HTa:p (Example A, herein). The trimer and monomer SEC 

25 fractions of HTa-Ll 1-p were therefore compared to the HTp-Ll 1-a holo-protein (both fusion 
proteins have identical amino acid composition). When denatured in acid urea, the trimeric 
fraction of HTa-Ll 1-P had PCB absorbance about midway between three PCBs per protein 
(holoprotein) and two PCBs per protein, while the monomeric fraction gave absorbance. very 
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close to that of two PCBs per protein. An interpretation of that result is that the a domain of 
HTa-Ll 1-P fusion protein is not fully chromophorylated. Only about 50% of the trimeric HTa- 
Ll 1-P proteins had PCB in the a domain, while most monomeric HToc-Ll 1-p proteins had no 
PCB addition in the a domain. Because of the nearly equal distribution between trimers and 
5 monomers, fewer than 30% of the a subunit domains of affinity-purified HTa-Ll 1-p proteins 
carry bilin. It is noteworthy that apo-HTa:holo-P and apo-a:holo-HTP proteins also run mostly 
as monomers on SEC-HPLC, with apparent molecular weights substantially smaller than 
theoretical calculations (Example A, herein). The fact that most of the HTa-Ll 1-P constructs 
have an apo-oc domain could also explain why more native a than native p phycocyanin subunits 

10 were copurified with HTa-Ll 1-p, since the a-Cys 84 bilin is involved in the a:P interaction and 
in stable monomer packing (55, 62) 

Electrospray mass spectral analysis of the SEC-HPLC monomer fraction of HTa-L 1 1 -P 
gave only one major peak with a value of 40,327. 8±5.7, corresponding well with the value of 
40,331.7 daltons, obtained by subtracting from 41,1 18.8 Dal (molecular weight of holo-HTa- 

15 LI 1-p) the values 131.1 (initial Met residue), 57.0 Dal (the C-terminal or the second residue 
from the N-terminus, Gly), 585.0 (one PCB), and 14.0 (one methyl group). This not only 
confirms the incomplete bilin addition (likely on the a domain) of HTa-Ll 1-p, but also suggests 
impaired posttranslational methylation on the y-amino group of Asn 72 of the p domain (51,52). 
The latter is unexpected because HTP (Example A, herein) and HTp-Ll 1-a (see above) purified 

20 from Anabaena sp. are both fully methylated on p-Asn 72 . 

The SEC-HPLC trimer fraction of HTa-Ll 1-p (at >10~ 6 M) had a Eof 677,000 M~ 1 cm- 1 . 
The fluorescence emission spectrum for 560 nm excitation, with ?iat 643 nm and a Of of 0.22, 
was virtually identical to that of wild-type phycocyanin. The fluorescence excitation spectrum for 
655 nm emission corresponded reasonably well with the absorbance spectrum, with a slight shift 

25 in the maximum to 613 nm, likely due to the small amount of native phycocyanin in the 

preparation (Table 6). The SEC-HPLC monomer fraction of HTa-Ll 1-p had spectroscopic 
properties similar to those of apo-HTa:holo-P monomers (Example A). 
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Effect of shortening the linker - A PCR-generated mutant, HTa-L9-p, with the linker Ala- 
(Gly)4-Ser-(Gly)3, was isolated in the course of producing the HToc-Ll 1-p construct, where LI 1 
is Ala-(Gly)4-Ser-Gly-Ser-(Gly)3. In other respects the two constructs were identical. Like the 
HTa-Ll 1-p apoprotein (Ec320), the HTa-L9-(3 apoprotein (Ec319) was expressed in E. coli in 
5 high yield. HTa-L9-p (An319) isolated from Anabaena sp. exhibited the same, anomalous 

spectroscopic properties described above for the HTa-Ll 1-p construct. Analysis of the HTa-L9- 
P protein by SEC-HPLC showed trimers and monomers in a ratio of 1:4, significantly lower than 
the ratio of 1: 1 observed for HTa-Ll 1-p. Since the extent of chromophorylation of trimer and 
monomer fractions of HToc-L9-p was similar to those of respective fractions of HTa-Ll 1-p, the 

10 reduced trimer formation by HTa-L9-p may be a result of conformation change induced by the 
shorter linker between the subunits. 

Expression of phycocyanin fusion proteins incorporating oligomerization domains - The 
dissociation of phycocyanins to the monomer at low protein concentration is highly undesirable 
when these proteins are used as fluorescent tags, particularly because of the resultant decrease in 

15 the number of PCB chromophores per tag. The finding that recombinant phycocyanin subunits 
with a 24-residue extension at the N-terminus fold and assemble in the same manner as native 
phycocyanin subunits (Example A, herein) suggested the introduction of the GCN4 
oligomerization domains (60) at the subunit N-termini to produce stable phycocyanin oligomers. 
Expression of phycocyanin a subunit fused at the N-terminus to the trimerization domain GCN4- 

20 pll - The 33-residue peptide GCN4-pH forms homotrimeric parallel coiled coils with Kd < 1 0" 9 
M (60). Plasmid pBS314 (encoding HT-pH-a) was constructed to express a GCN4-pII-CpcA 
fusion (Table 4). 

When HT-pH-a was expressed in E.coli, the recombinant protein was found almost 
entirely in inclusion bodies. Similar results were observed whether the induction was at 30 or 37 
25 °C. This behavior is in sharp contrast to that seen with HTa [Ecl68; (Example A, herein)], where 
most of the recombinant protein remains soluble. A likely explanation is that trimerization of the 
GCN4-pH in the fusion protein is faster than the folding of the phycocyanin subunit domain and 
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interferes with folding of the latter by promoting random aggregation of the partially folded 
subunit domains. 

Cultures of Anabaena sp. PCC7120(pBS314) expressing HT-pH-a showed no negative 
phenotype, but had a bluer appearance than the wild type. Whole-cell absorbance spectra of 
5 Anabaena sp. PCC7 120(pBS3 14) showed a higher ratio of phycocyaninxhlorophyll a. Whole- 
cell fluorescence was -70% higher than that of the wild type or cells expressing HToc. Sucrose 
density gradient fractionation of Anabaena sp. PCC7120(pBS314) phycobilisome preparations 
showed that HT-pH-a represented a much higher proportion of the total phycocyanin in the 
trimer/hexamer fraction than in the intact phycobilisome fraction. 

10 Over 40% of the phycobiliprotein in Anabaena sp. PCC7 1 20(pBS3 14) cell lysate 

supernatant bound to the Ni 2 +-NTA column and eluted with 200 mM imidazole. This was the 
highest relative level of recombinant phycobiliprotein of any of the His-tagged constructs we 
have expressed in the course of these studies. An314 bound much more tightly to the Ni 2+ -NTA 
column than His-tagged phycocyanin constructs not containing oligomerization domains. The 

15 tightness of binding was evidenced by the slow desorption of the recombinant protein from the 
Ni 2+ -NTA resin and the larger volume of imidazole buffer required for elution. Such behavior is 
to be anticipated from a molecule in which three polypeptides, each with a 6xHis tag, are 
trimerized through the GCN4-pH coiled coil. SDS-PAGE analysis showed that the An3 14 
phycobiliprotein fraction was a stoichiometric HT-pn-a:(5 holoprotein complex. Chromatography 

20 on SEC-HPLC showed that the HT-pH-a: p preparation was >75% trimer with some higher 
molecular weight component(s), presumably hexamer. Monomers were not detected. 

The spectroscopic properties of (HT-pII-a:p) 3 are compared with those of the native C- 
phycocyanin trimer, (oc:p) 3 in Table 6. The trimerized construct has significantly higher 
A ma x-A 36 o nm ratio than the native (a:P) 3 and a higher <J>p. Moreover, the absorbance spectrum of 

25 (HT-pn-a:P) 3 was unchanged at low protein concentrations where native phycocyanin, HTa:p, 
and a:HTp were monomeric with significantly blue-shifted absorbance maxima and decreased 
Amax^A3 6 onm ratios (Table 6; Example A, herein). Fluorescence polarization measurements 
provided an independent assessment of the stability of (HT-pII-a:p) 3 at low protein 
concentrations. In trimeric phycocyanin, the fluorescence polarization is very low because of 
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rapid energy transfer among the nine phycocyanobilin chromophores. In contrast, in the 
phycocyanin monomer, the three PCBs are well separated and depolarization through energy 
transfer is minimized. Our data show that at high protein concentrations, where various 
phycocyanin preparations are trimeric, their fluorescence polarization is similar and low, with 
5 values of 0.035 to 0.070. The fluorescence polarization of HTa:(3 and a:HT(3, like that of native 
phycocyanin, rose sharply at very low protein concentrations, indicative of trimer dissociation, 
whereas that of (HT-pn-a:p) 3 was independent of protein concentration. 

Expression of phycocyanin a subunit fused at the N-terminus to the tetramerization 
domain GCN4-pLI -The 33-residue peptide GCN4-pLI forms very stable homotetrameric 

10 parallel coiled coils (60). Several plasmids were constructed to express GCN4-pLI-oc fusion 

proteins (Table 4), including plasmids pBS321 (encoding HT-pLI-a) and pBS323 (encoding HT- 
Strep2-pLI-oc). The 10-residue Strep2 tag was incorporated into the HT-Strep2-pLI-a to test 
whether it would confer streptavidin-binding specificity on the construct. 

When expressed in E. coli, GCN4-pLI-oc fusion proteins, Ec321 (HT-pLI-a) and Ec323 

15 (HT-Strep2-pLI-a), were mostly found in inclusion bodies. In contrast, when expressed in 

Anabaena sp. the fusion proteins An321 and An323 remained soluble. The yields of these His- 
tagged phycobiliproteins from this strain were high, reaching > 20% of total phycobiliprotein. As 
described above for the GCN4-pH-a fusion proteins, the GCN4-pLI-a fusion protein (An32 1) 
bound very tightly to the Ni 2 +-NTA resin. SDS-PAGE analysis showed that An321 protein 

20 fraction purified by IMAC had the composition of HT-pLI-oc:(3. Analysis by SEC-HPLC showed 
two components, a small amount of tetramer, (HT-pLI-oc:p) 4 , and the balance as a much larger 
component. The elution position of the latter lay outside the range of calibration of the size 
standards for the column, but extrapolation of the calibration curve allowed an estimate of the 
size of the larger component. The calculated molecular weight was consistent with that of a 

25 trimer of tetramers (Table 5). 

Presumably, such a trimer would be formed by the interaction of three (HT-pLI-oc:p) 4 
assemblies through one of their phycocyanin a:p domains (while the other three ocp domains 
forming a trimer) in a manner analogous to that leading to the formation of (HT-pII-a:p) v In 
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such a "trimer of tetramers", all a:p monomers are within phycocyanin trimers. Tetramers, each 
held together by a GCN4-pLI domain, would be the expected products of dissociation of such a 
"trimer of tetramers" at very low protein concentration. In accord with this expectation, the 
absorbance spectrum of HT-pLI-cc:p at low protein concentration showed a A, max of 621 nm and 
an A max :A 360 nm ratio of 7.5 (Table 6), values very similar to those obtained for (HT-pII-a:p)3 
(see above). 

HT-Strep2-pLI-oc:p (An323) exhibited physical and spectroscopic properties identical to 
those of HT-pLI-oc:p (An321) (Table 6), showing that the presence of the Strep2 sequence does 
not interfere with tetramerization by the GCN4-pLI domain. 

Expression of subunit-fusion phycocyanins fused at the N-terminus to the trimerization 
domain GCN4-pII - As described above, His-tagged P-a subunit-fusion phycocyanin (An3 15) 
has spectroscopic properties identical to those of HToc:Pand a:HTp. Like those proteins, 
however, HT-p-Ll 1-a also dissociates to monomers at high dilution. To increase utility of the 
HT-p-Ll 1-a fusion phycocyanin as a fluorescent label, we fused the GCN4-pII domain to this 
construct to enhance trimer stability. 

Plasmid pBS358 encodes HT-pII-p-Ll 1-a (Table 4). When expressed in E. coli, HT-pII- 
P-Ll 1-a apoprotein was found mostly in inclusion bodies, but when expressed in Anabaena sp. 
the HT-pH-P-Ll 1-a protein remained soluble. Cultures of Anabaena sp. PCC7120(pBS358) had 
phenotypes very similar to those of the Anabaena sp. PCC7120(pBS3l5) (see above), including 
the slower growth, "bluer" color, and increased whole-cell fluorescence. Analysis of 
phycobilisome preparations from this strain showed that HT-pH-P-Ll 1-a was incorporated into 
the phycobilisomes. The 6xHis-tagged phycobiliprotein fraction (An358) purified by IMAC 
represented >22% of total cellular phycobiliproteins. As noted above for An314 (HT-pII-a:P), 
the An358 fraction also bound very tightly to the Ni2+-NTA resin. 

SDS-PAGE analysis of the An358 fraction showed that it consisted >90% of HT-pII-P- 
Ll 1-a along with a small amount of copurified wild-type phycocyanin a and p subunits (in a 
ratio of 1:1). Analysis by SEC-HPLC at protein concentrations <500 fig/ml, showed that HT-pII- 
P-Ll 1-a preparation was largely trimeric with a small amount of a larger component (Table 5). 
Comparison of the spectroscopic properties of (HT-pH-p-Ll l-)3, purified by SEC-HPLC, with 
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those of native C-phycocyanin trimer (ap)3, showed no significant differences (Table 6). The 
absorbance spectrum of (HT-pH-b-Ll l-a)3 did not change with dilution to very low protein 
concentration (Table 6), indicating that the GCN4-pH domain prevented dissociation of the 
trimer. 

5 The a-p fusion constructs purified from Anabaena sp., HTa-L9-P and HTa-Ll 1-p, 

preferentially form monomers even at micromolar protein concentrations (see above). The 
similar construct incorporating the GCN4-pH domain, HT-pH-a-Ll 1-p, is encoded by plasmid 
pBS362 (Table 4). The HT-pH-a-Ll 1-p protein purified from Anabaena sp., An362, was found 
to be largely trimeric (84%) with the remainder hexameric (Table 5). No monomer was observed 

10 in the SEC-HPLC analysis. It is evident that the trimeric state of HT-pH-a-Ll 1-p depends on 
trimerization of the GCN4-pII domain. Although remaining trimeric at low protein 
concentrations, the HT-pH-a-Ll 1-P protein had spectroscopic properties similar to those of 
HTa-Ll 1-p protein (Table 6), including the incomplete chromophorylation of the a domain. 
Expression of recombinant phycocyanins with a tag conferring biospecificity - The 

15 constructs described above successfully address the requirement that the phycocyanin constructs 
maintain their oligomeric state at very low proteins concentrations. Next, we examine the 
introduction of an additional domain into such constructs, one which confers biospecific 
recognition. As a test case, we introduced a tag which binds specifically to streptavidin. 
Recombinant phycocyanins containing such a tag are immediately useful as reagents in the many 

20 applications of the widely used streptavidin-biotin methodologies (63). 

Expression of Strep2-tagged phycocyanin constructs - The 10-residue Strep2 peptide 
forms complex with streptavidin with a Kd of 7.2xl0- 5 M (57). It is believed that the Step2 tag 
retains its affinity for streptavidin when fused either to an N- or C-terminus of a protein (57). 
Plasmids pBS327 and pBS323 were constructed to encode HT-Strep2-a and HT-Strep2-pLI-a, 

25 respectively (Table 4). E. coli and Anabaena strains expressing either fusion protein were 
phenotypically identical to those expressing the corresponding non-Strep2-tagged proteins. 
Affinity-purified HT-Strep2-a:p (An327) and HT-Strep2-pLI-a:p (An323) proteins also had 
spectroscopic and aggregation properties very similar to corresponding non-Strep2-tagged 
proteins HTa:p (An 168) and HT-pLI-a:p (An321) (Tables 5 and 6). 
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When blotted on a membrane, HT-Strep2-oc (Ec327 and An327 and HT-Strep2-pLI-a 
(Ec323 and An323) were readily detected by probing with streptavidin-alkaline phosphatase 
conjugate, indicating that the Strep2 peptide sandwiched between the 6xHis tag and the GCN4- 
pLI domain still retains its affinity for streptavidin. When streptavidin-coated agarose beads were 
5 used as target, HT-Strep2-a:|3 holoprotein was largely removed from the beads on washing, a 

result anticipated from the low affinity of a single Strep2 tag for streptavidin [Kd = 7.2x 10" 5 M; 
(57)]. The HT-Strep2-pLI-oc:|3 holoprotein, with its multiple Strep2 tags, bound much more 
strongly to the streptavidin-coated beads. This experiment demonstrates that oligomeric 
phycobiliprotein fusions with multiple Strep2 tags show the appropriate target specificity in 

10 streptavidin-based fluorescence assays. 

Example C. Recombinant phycobiliprotein fusion proteins with carboxyl-terminal tagging. 

We present here the in vivo production of biotinylated phycocyanin constructs that are 
readily usable in the many well-developed biotin/avidin applications (63). Materials and methods 
essentially as described in Example A or B are not restated. 

15 Construction of expression plasmids - Plasmids coding for relevant protein fusion 

constructs are listed in Table 7. The expression vector pBS150v [(Example A); GenBank 
Accession number AF177932] was used to make plasmid pBS339v (4,677 bp) for expression of 
His-tagged proteins capable of being biotinylated in £. coli. The 7 codons between the 6xHis tag 
and theTEV site in pBS150v were replaced with the 21 codons specifying the BTN tag. The 1 3- 

20 residue sequence of the BTN tag was derived from the consensus sequence that can be 

biotinylated in E. coli (64). An AsuU restriction site was designed into the BTN sequence as a 
signature site to facilitate PCR product characterization and subsequent recognition of the BTN 
tag sequence. The Ndel-cpcA-HindlR fragment from pBS185 (Example A) was inserted between 
Ndel and Hindm sites of pBS339v, giving plasmid pBS329v (5,138 bp) encoding 6xHis- and 

25 BTN-tagged phycocyanin a subunit (Table 7). The BTN tag is positioned on the carboxyl- 
terminal side of the 6xHis tag, so that any recombinant molecules that have lost the biospecificity 
tag through proteolysis are not retained on the IMAC column. The Ndel-glbN-HindEI fragment 
from plasmid pGlbN (66) was similarly cloned to give plasmid pBS355, encoding 6xHis- and 
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BTN-tagged cyanoglobin (Table 7). The same Ndel-HindUl fragment cloned into pBS 150v gave 
plasmid pBS121v (5,005 bp) encoding 6xHis-tagged cyanoglobin (Table 7). 

The cloning vector pBS370v (4,589 bp) for expression of C-terminal Strep tagged 
proteins (Table 7) was derived from the cloning vector pBS152v [(Example A); GenBank 
5 accession No. AF 177933] using inverse PCR. The 45-bp HindUl-Bgll fragment at the end of the 
multiple cloning sites was replaced with the 68-bp HindlR-Strep-Bgll sequence. An Ehel site was 
engineered into the Strep tag coding sequence (56) as a signature site. Phycocyanin genes, in the 
form of Ndel-Xmnl fragments from cognate plasmids (Example B) were inserted between Ndel 
and Xmnl sites of pBS370v to generate phycocyanin-Strep tag fusions. The expression cloning 

10 vector pBS350v, an enhanced version of pBS 152v, was created by replacing bp 3,80 1 to 4,070 of 
pBS152v with the 333-bp sequence described above. 

DNA fragment encoding the 1 14- AA C-terminal portion of the Anabaena sp. PCC7 1 20 
BCCP protein (BCCP1 14) was amplified from Anabaena genomic DNA. The 0.36-kb PCR 
fragment was digested with EcoTlll and cloned into the EcoTlll site of pBS350v, giving 

15 plasmid pBS344v (4,977 bp). Again, phycocyanin genes, in the form of Ndel-Xmnl fragments 
from cognate plasmids (Example B), were inserted between Ndel and Xmnl sites of pBS344v to 
generate phycocyanin-BCCPl 14 fusion constructs with the BTN flexible linker. The 20-residue 
flexible linker between the two moieties was designed to (a) bear a thrombin recognition and 
cleavage site to allow separation of the fusion partners if so desired, and (b) be sufficiently long 

20 to allow packing of BCCP1 14 on the outside of the rod substructures after the fusion protein is 

assembled in phycobilisomes (Example A, Example B). The GCN4-pII trimerization domain was 
introduced via modular cloning (Example B) between the 6xHis tag and the TEV protease site N' 
of the phycocyanin genes. 

Expression of BTN-tagged fusion proteins - The 13-residue BTN tag has the consensus 

25 sequence of peptides found to be biotinylated in E. coli (64). Covalent attachment of a biotin to 
the Lys residue is apparently catalyzed by the biotin ligase encoded by the birA gene (65, 67). 
The BTN tag was fused to the N-terminus of the phycocyanin a subunit in an attempt to produce 
in vivo biotinylated phycocyanin. When the HT-BTN-a phycocyanin subunit, encoded by 
plasmid pBS329, was expressed in E. coli, the purified Ec329 protein was shown biotinylated in 
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Western analysis. The same protein produced in Anabaena sp. PCC7120 (An329), however, was 
very poorly biotinylated, if at all, even when the culture medium had been supplemented with as 
much as 500 jxM biotin. Two other assays for the presence of biotin on An329, competition with 
HABA {2-[(4-hydroxyphenyl)-azo]benzoic acid} for streptavidin in solution (68) and mobility 
5 shift in SEC-HPLC, indicated lack of biotinylation. Parallel results were obtained in a 
comparison of Ec317 (HT-BTN-pH-a) and An317 (HT-BTN-pII~a:P) proteins. 

Sucrose density gradient experiments showed that both HT-BTN-oc (An329) and HT- 
BTN-pH-a (An317) proteins were assembled into the phycobilisomes. To test whether the lack 
of An317 and An329 biotinylation in vivo was due to the fact that the recombinant phycocyanins 

10 were quickly assembled into the phycobilisome upon biosynthesis and therefore not available to 
the biotin ligase, the cyanogiobin (GlbN) protein from cyanobacterium Nostoc commune (66) 
was used. This myoglobin-like monomeric hemoprotein has a specific subcellular location 
around the periphery of the cytosolic face of the cell membrane (69), but is not membrane-bound. 
His-tagged cyanoglobins HT-GlbN and HT-BTN-GlbN (encoded by plasmids pBS 121 and 

15 pBS355, respectively; Table 7) expressed in both E. coli and Anabaena sp. were found to have 
absorbance spectra nearly identical to that described for the native GlbN (70). However, while 
the Ec355 protein was found to be biotinylated, the An355 protein was not. The negative results 
for biotinylation in Anabaena sp. of all three different proteins tested, An317, An329, and 
An355, suggest that the BTN tag is not recognized by the Anabaena biotin ligase. 

20 Analysis of the IMAC-purified An329 proteins by SDS-PAGE incated the composition 

HT-BTN-oc: p. The An329 SEC-HPLC elution profile and spectroscopic characteristics were 
virtually identical to the non-BTN-tagged counterpart Anl68 (HToc:p; Table 8). The yield, 
however, was surprisingly low: An329 recovered by MAC was <3% of total cellular 
phycobiliprotein, over 10-fold less than obtained for Anl68. This was also the case for An3 1.7 

25 compared to its non-BTN-tagged counterpart An314 [HT-pII-a:P; (Example B)]. The yield of 
An355 was also dramatically lower than that of An 121. These results suggest that the BTN tag 
destabilizes the tag-bearing proteins in Anabaena cells. Since An317 and An329 are assembled 
into the phycobilisomes, the latter may also be destabilized. Higher turnover of phycobilisomes 
and of An329 proteins in the cell may explain the observed phenotype of lower cellular 
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phycobiliproteins in the strains Anabaena sp. PCC7120(pBS317) and Anabaena sp. 
PCC7120(pBS329). 

Expression of truncated Anabaena BCCP114 protein - Since the BTN tag was not 
biotinylated in Anabaena sp., naturally occurring biotinylated proteins were investigated. A 
truncated gene encoding the C-terminal 1 14 residues of the Anabaena BCCP protein, covering 
the biotinylation domain (corresponding to the E. coli BCCP84 that is sufficient for enzyme- 
catalyzed biotinylation) and a large portion of the flexible linker, was amplified from the genome 
and cloned into pBS350, giving plasmid pBS344 (Table 7). Plasmid pBS344 encodes the 6xHis- 
tagged Anabaena BCCP1 14 with a long linker bearing the TEV and thrombin protease sites. 

E. coli cultures expressing the 6xHis-tagged Anabaena BCCP1 14 protein (Ec344) grew 
normally, and produced very soluble Ec344 protein at >10 mg per liter of culture grown and 
induced at 37 °C. The protein was shown biotinylated by Western analysis. Electrospray mass 
spectroscopy analysis suggested that about 20% of the Ec344 protein produced under these 
particular conditions was biotinylated. Growth and induction at 30 °C for longer period of time 
was attempted to increase the percentage of biotinylated holoprotein fraction of purified Ec344. 
However under these conditions, the yield of affinity purified protein was greatly decreased, 
possibly as a result of lowered production and increased degradation of the Ec344 protein. 

Cultures of Anabaena sp. PCC7120(pBS344) expressing the An344 protein were fairly 
healthy, albeit having a slightly reduced level of total cellular phycobiliproteins. The yield of 
An344 protein, however, was quite low (usually < 1 mg per liter of dense culture), possibly as a 
result of proteolytic clipping of the relatively long linker between the 6xHis tag and the 
BCCP1 14 domain. The An344 protein was shown to be biotinylated in Western analysis. 
Electrospray mass spectral analysis indicated that about 40% of the molecules were holo- {i.e., 
biotinylated) proteins, substantially higher than the fraction of those produced in E. coli. These 
results encouraged the study of C-terminal phycocyanin a and p fusions with BCCP 1 14. 

Expression of phycocyanin (3 subunit-BCCPl 14 fusion proteins - Plasmid pBS353 
encodes the fusion protein HTp-BCCPl 14 (Table 7). Overexpression of Ec353 in E. coli gave 
high yield of the protein in soluble form. Since expression of Ec262 (HT(3) under similar 
conditions leads to substantial formation of inclusion bodies (Example A), the BCCP1 14 domain 
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appears to enhance the solubility and folding of the fusion protein. The HTp-BCCPl 14 
preparation from E. coli was found biotinylated in Western analysis. 

Cultures of Anabaena sp. PCC7120(pBS353) generally had an unhealthy, yellowish green 
appearance, and the growth rate was at least 20% slower than that of the wild type even under 
high-light conditions. The yellowish green color of the cultures results from -50% reduction of 
cellular phycocyanin as indicated by whole-cell absorption spectra. Upon IPTG induction of 
overproduction of the HTp-BCCPl 14 protein, cultures in late-log growth frequently stopped 
growing, and further incubation of the culture often led to general cell lysis. This unexpected 
result, which was also observed with the HTa-BCCPl 14 construct (An351; see below), led to the 
abandonment of IPTG induction of Anabaena sp. PCC7120(pBS353) cultures. Expression of the 
HTp-BCCPl 14 protein, therefore, was left to the leaky trc promoter (Example A), and generally 
resulted in a fairly low yield of An353 (< 5% of total cellular phycobiliproteins). Nonetheless, 
very high cell densities could be obtained with these uninduced cultures. 

The HTP-BCCP114 protein was incorporated into phycobilisomes, with no obvious 
destabilizing effects. On SDS-PAGE analysis, however, the affinity-purified An353 protein had a 
substantially larger amount of the HT|3-BCCP1 14 subunit than the partner phycocyanin holo-a 
subunit, indicating a lowered stability of a:HTp-BCCPl 14 as compared with that of a:HTp. 
Also, a small fraction of the HTp-BCCPl 14 preparation had lost the BCCP1 14 moiety by 
proteolysis. 

On SEC-HPLC the An353 preparation separated into two major components. The larger 
component, with a retention time of about 12.4 min, had an apparent mass of 183.7 kDal, 
somewhat larger than 162.2 kDal calculated for the trimer, (oc:HTp-BCCPl 14) 3 . On SDS-PAGE 
this larger fraction had only phycocyanin a and HT(3-BCCP1 14 subunits, in nearly 1 : 1 ratio. 
Upon exposure to Zn 2+ , fluorescence from the HTP-BCCP1 14 band under UV illumination was 
about twice as bright as that from the phycocyanin holo-a subunit, suggesting full bilin content 
(two PCBs per p) of the recombinant phycocyanin P subunit. The smaller component, eluted at 
-14.4 min, contained mostly HTP-BCCP1 14, with an apparent molecular weight corresponding 
to that of the homodimer, (HTp-BCCPl 14)2. While this result is in accord with the observation 
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that phycocyanin HT(3 subunits can form stable homodimers (Example A), it also suggests that 
the HTp-BCCPl 14 subunits have a lowered affinity for the phycocyanin a subunit, and may lose 
some of the a subunits during affinity purification. 

Polypeptides in the HTp-BCCPl 14 preparation lacking the BCCP1 14 portion represent a 
5 higher proportion of the fraction containing (HTp-BCCPl 14) 2 than of the (a:HTp-BCCP 1 14) 3 

fraction. Such proteolytic removal of the BCCP114 domain of Ec353 was not observed in E. coll. 
The degradation observed in the preparations from Anabaena cells may therefore be the result of 
the much longer time the fusion protein An353 remains inside Anabaena cells, and may also 
reflect higher activity of Anabaena sp. proteases towards the long, thrombin site-bearing linker. 

10 spectroscopic properties of the trimeric fraction, (a:HTP-BCCPl 14)3, were similar to 

those of (a:HTp) 3 (Table 8). The slightly blue-shifted Amax of (a:HTP-BCCPl 14) 3 and the lower 
Amax- A360nm ratio may reflect incomplete bilin addition to the p subunit, with a consequent 
perturbed conformation and lowered trimer stability in a portion of the preparation. 
Western analysis showed that HTp-BCCPl 14 produced in Anabaena sp. was 

15 biotinylated. Binding experiments with monomeric avidin-coated beads were performed to 
further assess the utility of An353 proteins as labels. When avidin molecules were in excess, 
-30% of the An353 preparation could be immobilized on the beads and then specifically eluted, 
indicating a level of HTP-BCCP1 14 biotinylation in Anabaena sp. of about 30%. When an 
excess of An353 was used, saturation by the recombinant phycocyanin of nearly all of the biotin- 

20 binding sites on the avidin-coated beads was achieved. With appropriate excitation, the stained 
beads emitted brilliant red fluorescence. 

Expression of HT$-BCCP 114 fusion protein bearing the GCN4-pII trimerization domain 
- Only about a half of the HTP-BCCP1 14 expressed in Anabaena retained the phycocyanin a 
subunit during purification. We had noted in earlier studies (Example B) that phycocyanin fusion 

25 proteins with GCN4-pH trimerization domains remained trimeric even at very low protein 
concentrations and contained both subunits in stoichiometric amounts. This is most likely 
attributable to the greatly increased local concentration of subunits. 
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HT-pH-p-BCCPl 14, encoded by plasmid pBS359 (Table 7), was expressed at high level 
in E. coli. About 25% of Ec359 could be isolated in soluble form, much more than seen with pll- 
phycocyanin subunit fusion constructs (Example B), again showing the increase in solubility 
attributable to the presence of the BCCP1 14 domain (see above). 
5 Cultures of Anabaena sp. PCC7120(pBS359) were yellowish, similar to those of 

Anabaena sp. PCC7120(pBS353), with an even slower growth rate. However, such cultures grew 
to very high cell densities after IPTG induction. 

The HT-pH-p-BCCPl 14 protein was assembled into phycobilisomes with no obvious 
destabilizing effect. In contrast to An353, affinity-purified An359 protein preparation had the 

10 HT-pII-P-BCCPl 14 subunit and the phycocyanin holo-a subunit in a 1 : 1 ratio. The yield of 

An359 was surprisingly low, generally <2% of total cellular phycobiliproteins. Almost no HT- 
pll-p lacking the BCCP1 14 moiety was observed, suggesting that in the stable trimers, (a: HT- 
pH-p-BCCPl 14)3, the long linker between P and BCCP1 14 is shielded from proteases. 

On SEC-HPLC, the An359 protein preparation migrates as a component with an apparent 

15 mass of 475.2 kDal, significantly higher than the 350.3 kDal calculated for a hexamer, [(ot:HT- 
pH-p-BCCPl 14)3]2. A phycocyanin hexamer is normally formed by face-to-face stacking of two 
trimers (42). A model of the (a:HT-pII-P-BCCPl 14)3 trimer reveals that a hexamer could form 
by two trimers stacking on the BCCP1 14 face, with the BCCP1 14 domains on the outside of the 
trimer rings. Such a hexamer would have a larger radius of gyration and exhibit a higher apparent 

20 mass on SEC. It should be noted that at very low protein concentration the hexamer is expected 
to dissociate into trimers. Spectroscopic properties of (a:HTp-BCCPl 14)3 (Table 8), were very 
similar to those of GCN4-pII-bearing constructs such as (HT-pE-a:p)3. 

Western analysis showed that HT-pII-p-BCCP114 obtained from Anabaena sp. was 
biotinylated. The utility of this construct as a fluorescent label was explored in binding 

25 experiments with avidin-coated beads. With avidin in excess, about 75% of (a:HT-pH-P- 
BCCP1 14)3 w as immobilized on the beads and then specifically eluted off. The extent of 
biotinylation of the BCCP1 14 domain in HT-pH-P-BCCPl 14 proteins is presumably -30%, 
similar to that seen in HTP-BCCP1 14 (An353). The higher binding percentage is anticipated, 
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since only one of the three BCCP1 14 domains needs to be biotinylated for the entire trimer to 
bind to the beads. When an excess amount of (a:HT-pD-p-BCCPl 14)3 was used, the amount of 
a:HT-pH-p-BCCPl 14 monomer equivalents immobilized was nearly 2.5-fold that of monomelic 
avidins. Under phycocyanin excitation, (a:HT-pII-P-BCCP114)3-stained avidin- and 
5 streptavidin-coated beads were highly fluorescent. 

Expression of phycocyanin-BCCP 114 fusion protein with covalently bridged a and (3 
subunits - Constructs with covalently bridged a and p subunits provide a way of ensuring 1 : 1 
a:P stoichiometry, as demonstrated earlier with the HTa-Ll 1-p construct (Example B). HTa- 
Ll 1-P-BCCP1 14 encoded by plasmid pBS361 (Table 7) was expressed well in E. coli and 

10 acceptably in Anabaena sp. 

HTa-Ll 1-P-BCCP1 14 protein was assembled into the phycobilisome. As with HTa-Ll 1- 
P (Example B), such phycobilisomes were less stable. The affinity-purified An361 fraction 
represented -10% of total cellular phycobiliprotein and consisted >90% of HTa-Ll 1-P- 
BCCP114. A small amount of native phycocyanin a and p subunits copurified with HTa-Ll 1-p- 

15 BCCP1 14, indicating that the a and p subunits in the fusion construct retained their ability to 

interact with native subunits. A small fraction of the HTa-Ll 1-p-BCCPl 14 proteins had lost the 
BCCP1 14 portion by proteolysis. 

SEC-HPLC fractionation showed that the An361 preparation consisted of trimers and 
monomers in an -1: 1 ratio. Spectroscopic properties of the trimers, (HTa-Ll 1-P-BCCP1 14)3, 

20 were found very similar to those of (HTa-Ll l-p)3 (Table 8), indicating that carboxyl terminal 

attachment to BCCP1 14 did not perturb the phycocyanin subunit domains. In a related construct, 
involving a carboxyl terminal fusion of the a subunit, (HTa-Ll l-p)3, bilin addition to the a 
subunit domain was shown to be very incomplete (Example B). Since (HTa-Ll l-p)3 and (HTa- 
Ll 1-p-BCCPl 14)3 have virtually identical spectroscopic properties (Table 8), bilin addition to 

25 the a subunit domain of (HTa-Ll 1-p-BCCPl 14)3 is presumably also incomplete. 

t The HTa-Ll 1-p-BCCPl 14 protein preparation from Anabaena sp. was shown to be 
biotinylated in Western analysis. In binding experiments with streptavidin- and avidin-coated 
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beads, the An361 preparation gave results similar to those obtained with HT(3-BCCP1 14 (see 
above), indicating -30% biotinylation of the BCCP domain of HTa-Ll 1-p-BCCPl 14. 

Expression of phycocyanin-BCCP 114 fusion protein with covalently bridged a and P 
subunits and the GCN4-plI trimerization domain - Plasmid pBS365 was constructed to encode 
5 the HT-pH-a-Ll 1-p-BCCPl 14 (Table 7). Unlike Ec362 (HT-pH-a-Ll 1-p) which is found almost 
entirely in inclusion bodies (Example B), Ec365 expressed in E. coli remained soluble, again 
indicating the solubility-promoting effect of the BCCP1 14 domain. 

HT-pH-a-Ll 1-p-BCCPl 14, expressed in Anabaena sp. PCC7120(pBS365), was found to 
be assembled into the phycobilisome. The yield of affinity-purified An365 was >5% of total 
10 cellular phycobiliprotein. The An365 preparation consisted of >90% HT-pH-a-Ll 1-p-BCCPl 14 
with a small amount of copurifying native phycocyanin a and P subunits. Virtually no loss of the 
BCCP1 14 moiety was observed. 

On SEC-HPLC, the An365 preparation was found to consist mostly of hexamers, with a 
small fraction of trimers. (HT-pH-a-Ll 1-P-BCCP1 14)3 had spectroscopic properties similar to 
15 those of (HT-pH-a-Ll l-p)3 (Table 8), again consistent with incomplete bilin addition to the 
phycocyanin a domain (Example B) . 

The An365 from Anabaena sp. was shown to be biotinylated in Western analysis and 
gave results in binding experiments with monomeric avidin-coated beads quantitatively similar 
to those obtained with An359, (a:HT-pH-P-BCCP114)3. Streptavidin-coated beads stained with 
20 excess of An365 appeared blue and emitted brilliant red fluorescence under appropriate 
excitation. 

.Expression of phycocyanin a subunit-BCCP 114 fusion proteins - Recombinant 
phycocyanin a subunits, with the amino-terminus fused to polypeptides of varying length, 
display unmodified bilin content and spectroscopic properties (Example A, Example B). As 
25 noted above, fusions at the carboxyl terminus of the a subunit interfered with bilin addition. To 
examine whether this behavior is specific to particular C-terminal fusions or is general, four 
phycocyanin a-BCCPl 14 fusion constructs analogous to the P-BCCP fusions described above 
were prepared (Table 7). 
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Expression of the four constructs, Ec351, Ec357, Ec360, and Ec364, in E. coli gave 
results very similar to those observed with the four corresponding phycocyanin (3-BCCP1 14 
constructs (see above). Expression of the four a-BCCPl 14 constructs in Anabaena sp., however, 
gave very different results. 

An35 1 (HTa-BCCPl 14) represented < 0.5% of total cellular phycobiliprotein. SDS- 
PAGE showed that the HToc-BCCPl 14 polypeptide had a very low bilin content, indicating that 
the BCCP1 14 extension on the a subunit greatly interferes with bilin addition. The low yield is 
likely attributable to the ease of proteolysis of the apo-HTa-BCCPl 14 polypeptide. 
The An351 preparation contained little phycocyanin P subunit, suggesting reduced affinity to the 
apo-HTa-BCCPl 14. The proteolytic cleavage of the long peptide linker between a and 
BCCP1 14 was also high in the apo-HTa-BCCPl 14 polypeptide. An351 preparations often 
contained a substantial amount of degradation product of HTa-BCCPl 14 in which the BCCP1 14 
moiety had been clipped off. The extremely low yield of An351 also suggests that the protein is 
rapidly turned over in the cell. 

Results from expression of An357 (Table 7) were nearly identical to those from An35 1 , 
except that the native phycocyanin p subunit was copurified with the (mostly apo-) HT-pH-a- 
BCCP1 14 subunit in nearly 1:1 ratio. Results from expression of An360 (HTP-L1 1-a-BCCPl 14) 
and An364 (HT-pH-p-Ll 1-a-BCCPl 14) were very similar. The yield was -5% of total cellular 
phycobiliprotein, substantially higher than that obtained from An351 and An357. A large 
percentage of the purified fusion protein, however, had lost the BCCP1 14 moiety. These results 
showed that interference with bilin addition to the phycocyanin a-subunit domain was a feature 
common to all of these different BCCP1 14 fusion constructs. 

Expression of phycocyanin a subunit-Strep tag fusion proteins - To address the question 
of the possible dependence of bilin addition to the phycocyanin a subunit on the size of the 
carboxyl-terminal fusion partner, such fusions were prepared with the 10-residue Strep tag (Table 
7). 

The yield of the His-tagged protein preparation, An386, horn Anabaena sp. 
PCC7120(pBS386) expressing HTa-Strep was <1% of total cellular phycobiliprotein (in great 
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contrast to >30% usually obtained from cells expressing HToc). On SEC-HPLC the An386 
preparation runs almost entirely as monomer, with a very small amount of trimers. The monomer 
peak contained HTa-Strep and native phycocyanin (3 in nearly 1:1 ratio. While the (3 subunit was 
the normal holoprotein, the HTa-Strep subunits were almost entirely apo-subunits. The apo- 
5 HTa-Strep appeared to have lowered affinity for the phycocyanin (3 subunit, as indicated by the 
observation that fractions collected from the trailing part of the monomer peak on SEC-HPLC 
were enriched in the apo-HTa-Strep subunit relative to the holo-(3 subunit. The spectroscopic 
properties of the HTa-Strep:(3 monomer were very similar to those of the apo-HTa:holo-(3 
monomer (Table 8; Example A), including the characteristic broad peak in the fluorescence 

10 excitation spectra. Thus, even a short extension on the carboxyl terminus of phycocyanin a 
subunit affects the posttranslational bilin addition. 

Expression of phycocyanin fusion protein with covalently bridged a and (3 subunits and 
the Strep tag - In a final attempt to address the problem of incomplete bilin addition in carboxyl 
terminal fusions of the phycocyanin a-subunit, we examined behavior of HT(3-L1 1-a-Strep 

15 fusions (encoded by plasmid pBS394; Table 7). The yield of affinity-purified An394 was ~ 10% 
of total cellular phycobiliprotein. Unlike the An315 preparation (from cells expressing HT[3-L1 1 - 
a) that consists mostly of trimers, the An394 proteins is mostly monomeric, with small amounts 
of trimeric and hexameric components. Proteins in the higher assembly states had a higher bilin 
content, with those in the hexamer fraction approaching three PCBs per HT(3-L1 1-a-Strep 

20 molecule, i.e., holoproteins. Proteins in the monomer fraction on average contained two PCBs 
per HTJ3-L1 1-a-Strep molecule, most likely lacking the a subunit bilin. In total, <10% of the 
polypeptides in the An394 preparation carried bilin on the a subunit domain, a small but 
significant improvement over the HTa-Strep: P construct. Most An394 preparation could be 
immobilized on streptavidin-coated beads, showing that the Strep tag in the HTp-Ll 1-a-Strep 

25 molecules has retained its affinity for streptavidin and is available for binding. 

TABLE 1 

Properties of Anabaena sp. PCC7120 and derivative strains and of expression vectors 
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carried by these strains relevant to the production of His-tagged C-phycocyanin a and (3 subunits 



20 



Strain and vectors 


Strain genomic 


Polypeptide construct 




characteristics 


encoded on vector 


PCC7120(pBS168) 


Wild-type 


HTa 


PCC7120(pBS262) 


Wild-type 


HT(5 


B646(pBS168) 


cpcBAC b has a transposon inserted 


HTa 




between the promoter and the cpcB 






open reading frame 




B646(pBS262) 


cpcBAC 


HT(3 


B64328(pBS168) 


cpcE c inactivated by transposon 


HTa 




insertion 




B64328(pBS262) 


cpcE 


HT(3 


B64407(pBS168) 


cpcF c inactivated by transposon 


HTa 




insertion 




B64407(pBS262) 


cpcF 


HT(3 


PCC7120(pBS167) 


Wild-type 


HTa A,2T 


PCC7120(pBS162) 


Wild-type 


HT(3 S46G ' N76D 



a The expression vectors, specified in parentheses, are derived from pBS 150v and pBS 150 as 
described in the text. 

25 b Genes cpcB and cpcA encode the (3 and a subunits of C-phycocyanin and cpcC encodes a linker 
polypeptide involved in the assembly of trimers (ccp)3 and hexamers [(apbh and their assembly 
into phycobilisomes (5, 6). 
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c cpcE and cpcF encode two subunits of a heterodimeric phycocyanin a subunit lyase required for 
the addition of phycocyanobilin to the apophycocyanin a subunit (1, 2-4). 

TABLE 2 



Molecqlar weights of His-tagged phycocyanins in different aggregation states, as determined by 
size exclusion chromatography and by analytical ultracentrifugation. 



Protein" 


SEC-HPLC 


AUC 


Major 


Aggregation 


Expected 




(kDa) 


(kDa) 


Components 


state 


MW (kDa) 


Native PC 


211.7 




a: (3 


Hexamer 


225.7 




124.7 






Trimer 


112.8 




44.5 






Monomer 


37.6 


Anl68 


301.6 




HTa:p 


Hexamer 


243.0 




145.3 


125.9 




Trimer 


121.5 




49.2 


34.8 




Monomer 


40.5 


An262 


136.9 


121.9 


a:HT(3 


Trimer 


121.5 




62.4 


43.4 




Monomer 


40.5 


Anl68-BAC 


289.3 




HTa:p 


Trimer 


243.0 




137.2 






Monomer 


121.5 




17.3 




HTa 


Subunit monomer 


20.9 


An262-BAC 


50.3 




HTp 


Subunit homodimer 


45.0 


Anl68-E/F 


106.8 




Apo-HTa:p 


Trimer 


119.7 




29.8 






Monomer 


39.9 


An262-E/F 


54.1 




HTp 


Subunit homodimer 


45.0 




33.8 




Apo-a:HTp 


Monomer 


39.9 



a "-BAC" suffix indicates proteins isolated from cpcBAC mutant strains of Anabaena sp., and "- 
E" and M -F" indicate proteins isolated from cpcE and cpcF mutant strains of Anabaena sp., 
respectively. 

TABLE 3 

Spectroscopic properties of His-tagged phycocyanins a . 
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Sample 


Abs max 
(nm) 


E x max 


Em max 
(nm) 


£m (IVHcnr 1 ) 
(xl 000) 




Native PC (ct:p)3 


615-618 


615 


644 


929 + 2.1 


0.27 ± 0.06 


(HTa:p) 3 


615-621 


618 


642 


921 ±36 


0.22 ± 0.05 


(a:HTp) 3 


615-619 


616 


642 


888 ± 33 


0.23 ± 0.04 


HTa 


618-622 


615 


637 


109 ± 2.5 


0.23 ± 0.02 


(HTP) 2 


605 


606 


639 


365± 13 


0.22 ±0.03 


apo-HTa:P 


607 


606 


642 


182 ±1.4 


0.24 ± 0.00 


One protein- 


660 b 






35.40 


0 


bound PCB 


280 






14.85 ±0.1 





a Absorbance maxima (Abs max ) are for protein concentrations between 0.05 and 3 jiM. Lower 

concentrations yield blue-shifted spectra in all cases where a range of wavelengths is given. 

Excitation and emission maxima (Ex max and Em max ), and quantum yields of fluorescence (Of) 
« 

are for samples at ^0.05 |LiM. Molar extinction coefficients (£ m ) were determined for the 
15 following complexes purified by SEC-HPLC : native PC, trimer fraction; HToc:p' trimer fraction 
of Anl68; a:HT(3, trimer fraction of An262; HTa, subunit monomer fraction of Anl68-BAC; 
HTp, subunit homodimer fraction of An262-BAC; apo-HTa:p; monomer fraction of Anl68-F. 

b Absorbance of protein-bound phycocyanobilin was measured with SEC-HPLC purified 
20 (HTot:p)3 and (oc:HTp)3 holoproteins, denatured in 8 M urea pH 2.0. The molar extinction 

coefficient at 660 nm is identical to that measured in 7.2 M urea pH 2.0, 9 mM DTT (35), while 
that at 280 nm was calculated by subtracting contributions from Tyr [£ = 1,370 M" 1 cm 1 ; (36)] 
and Trp [£ = 5,500 M* 1 cm" 1 ; (37)] residues. 

25 TABLE 4 

Expression plasmids with specific functional domains in the encoded protein 



Plasmid a Affinity Biospec. Coiled coil Protease Core 
tag tag b domain site protein 
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pBS311 


6xHis 




GCN4pn 


TEV 


LacZ« 


pBS314 


6xHis 




GCN4pH 


TEV 


CpcA 


pBS319 


6xHis 






TEV 


CdcA-L9-CdcB 


DBS320 


6xHis 






TEV 


CdcA-L1 1-CdcB 


PBS362 


6xHis 




GCN4oII 


TEV 


CdcA-L1 1-CdcB 


pBS315 


6xHis 






TEV 


CpcB-Ll 1-CdcA 


pBS358 


6xHis 




GCN4pH 


TEV 


CpcB-Ll 1-CpcA 


pBS283 




Strep2 




TEV 


LacZ a 


pBS342 


6xHis 


Strep2 




TEV 


LacZ a 


pBS327 


6xHis 


Strep2 




TEV 


CpcA 


prJo jUj 




otrepz 


VJd\l4p.Ll 


Itiv 




pBS309 


6xHis 


Strep2 


GCN4pLI 


TEV 


LacZ« 


pBS323 


6xHis 


Strep2 


GCN4pLI 


TEV 


CpcA 


pBS312 


6xHis 




GCN4pLI 


TEV 


LacZ« 


pBS321 


6xHis 




GCN4pLI 


TEV 


CpcA 



a Like the parent plasmids pBS150v and pBS150, all plasmids listed have two versions: the 
20 smaller one, indicated in the text with a suffix V, is more suitable for cloning manipulations, 
but is lacking the 3.7-kb pDUlHC fragment that enables autonomous plasmid replication in 
Anabaena sp. PCC7120 (Example A, herein). 

b The biospecificity tag is positioned on the carboxyl-terminal side of the 6xHis tag, so that any 
25 recombinant molecules that have lost the biospecificity tag through proteolysis are not retained 
on the IMAC column. The Strep2 tag is a 10-residue peptide with specific affinity for 
streptavidin. 

TABLE 5 
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Determination of apparent molecular weight of components in recombinant phycocyanin 
preparations by SEC-HPLO 

Protein Calc. mass Percent Assembly Theoretical 
(kDa) area state b mass (kDa) c 





" Anl68 


49.2 


5 


Monomer 


40.5 


5 


(HTa:p) 


145.3 


95 


Trimer 


12L5 




Aniz/ 


AA ^ 
44.Z 


4 


Monomer 


A 1 A 

41.4 




(HT-Strep2-a:(3) 


137.9 


94 


Trimer 


124.3 






275.4 


2 


Hexamer 


248.7 


10 


An314 


110.2 


79 


Trimer 


132.7 




(HT- P n-a:p) 


406.0 4 


21 


(Hexamer)2 


530.7 




An321 


232.8 


13 


Tetramer 


176.9 




(HT-pLI-a:P) 


499.5<! 


81 


(Tetramer)3 


530.7 


15 














An323 


264.0 


6 


Tptrampr 


182 7 




(HT-Strep2-pLI-a:p) 


429.3 d 


94 


(Tetramer)3 


548.2 






JJ.O 


7Q 


ivionomer 


*f 1 .u 


20 


(HTa-L9-p) 


99.3 


19 


Trimer 


122.9 






232.6 


2 


Hexamer 


245.8 




Anjzu 


1A 1 

J4. 1 


AO) 

49 


Monomer 


A 1 1 

41.1 




(HTa-Lll-p) 


89.7 


46 


Trimer 


123.4 






213.6 


5 


Hexamer 


246.7 


25 


An362 


128.9 


84 


Trimer 


134.5 




(HT-pH-a-Lll-p) 


304.7d 


15 


Hexamer 


269.1 




An315 


107.9 


76 


Trimer 


123.4 




(HTp-Lll-a) 


235.7 


23 


Hexamer 


246.7 
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An358 115.6 64 Trimer 134.5 

(HT-pn-p-Lll-ot) 587.2 d 35 (Hexamer) 2 538.0 

a For experimental conditions, see text. 

b By convention, the a: (J heterodimer is referred to as a phycocyanin "monomer." 
c Values are for holo-proteins. 

d Apparent molecular weight outside the range defined by the calibration standards. 



Spectroscopic properties of phycocyanin constructs 21 



Sample b Initial X max c A max / Ex max Em max £m <D f 

assembly state (nm) A360 nm (nm) (nm) -h 

Denatured a-j} 280 -- -- - 44,550 d ~ 

15 PC 660 1.0 -- -- ..19.6.200 0 

Native PC (a:p)3 615-619 6.4 615 644' 929,666 0.27 

An 168 (HTa:p) 3 615-621 6.4 618 642 921,000 0.22 

An327 (HT-Strep2- 615-619 6.4 618 642 922,000 0.23 

a:p)3 

An262 (g:HT(3)3 615-619 6.3 616 642 888,000 0.23 
20 An314 (HT-pn-arpb 621-622 7.5 622 642 927,000 6.39 

An321 (HT-pLI- 621-623 7.5 621 643 1,231,000 0.28 

a:f3) 4 

An323 (HT-Strep2- 621-623 7.2 621 644 1,240,000 0.27 
pLI-a:[3)4 

An3i'5 (OTpXTl'-a)3 615-622 £2 622 643 mOOO 0.21 

An358 (HT-pH-p- 619-621 6.7 620 643 915,000 0.29 

LII-0O3 

25 An320 (HTa-Lll-p) 3 605-608 5.2 613 643 677,000 e 0.22 

An362 (HT-pH-a- 607-610 5.8 615 642 707,000 f 0.24 

L"-P)3 , 
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a Trimer or tetramer components obtained by SEC-HPLC were used for all 
measurements. A ma x- A360 nm values, excitation maxima (Ex max ), emission maxima 
(Em ma x)> and quantum yields of fluorescence (Of) are given for measurements at protein 
concentrations <0.05 mM. Em values are given for measurements on the components 
specified in column 2 at protein concentrations >1 \iM. 

b Properties of proteins not described in this study are taken from Example A, herein. 

c ^ m ax values were determined at protein concentrations from 0.05 to 3 |LiM. Where the 
spectra shift to the blue with decreasing protein concentration, the limits of A, max values 
are those measured at the lowest and highest concentrations, respectively. 

d Absorbance from the three protein-bound PCBs only, excluding contribution from Tyr and Tip 
residues (Example A, herein). 
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TABLE 7 

List of fusion constructs indicating order of domains 



Fusion construct Plasmid 



6xHis-BTN-TEV-LacZ a ' p BS339 

6xHis-BTN-TEV-CpcA pBS329 

6xHis-BTN-pn-TEV-CpcA pBS3 1 7 

6xHis-BTN-TEV-GIbN pBS355 

6xHis-TEV-GlbN pBS121 



6xHis-TEV-TBN-BCCP114 pBS344 

6xHis-TEV-CpcB-TBN-BCCP114 pBS353 

6xHis-GCN4pn-TEV-CpcB-TBN-BCCP114 pBS359 

6xHis-TEV-CpcA-Ll 1-CpcB-TBN-BCCPl 14 pBS361 

6xHis-GCN4pn-TEV-CpcA-Lll-CpcB-TBN-BCCP114 pBS365 

6xHis-TEV-CpcA-TBN-BCCP114 pBS351 

6xHis-GCN4pn-TEV-CpcA-TBN-BCCP114 pBS357 

6xHis-TEV-CpcB-Lll-CpcA-TBN-BCCP114 pBS360 

6xHis-GCN4pn-TEV-CpcB-Lll-CpcA-TBN-BCCP114 pBS364 

6xHis-TEV-Strep PBS370 

6xHis-TEV-CpcA-Strep PBS386 

6xHis-TEV-CpcB-Lll-CpcA-Strep P BS394 



3 Functional domains in each construct are listed in order from N- to C-terminus, with 
particularly relevant domains in bold face. CpcA and CpcB correspond to the a and (3 subunits of 
phycocyanin, respectively. 
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b All plasmids have two versions. The smaller one, indicated in the text with a suffix V, is for 
cloning and expression in E. coli, and lacks the 3.7-kb pDUlHC fragment that enables 
autonomous plasmid replication in Anabaena sp. PCC7120 (Example A). 

5 TABLE 8 
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a Properties of constructs not described in this studies are taken from the accompanying studies 
(Example A, Example B). 

b Affinity-purified proteins were futher purified by SEC-HPLC, and collected fractions in 50 mM 
Na-phosphate pH 7.0 buffer ± 1 mM NaN3 were used for all measurements. A ma x- A360 nm 
5 values, excitation maxima (Ex ma x)> emission maxima (Em max ), and quantum yields of 

fluorescence (Of) are given for measurements at protein concentrations <0.05 \xM, Em values 
are given for measurements on the components specified in column 2 at protein concentrations 
>1 mM. 

c ^max values were determined at protein concentrations from 0.05 to 3 |xM. Where the spectra 
10 shift to the blue with decreasing protein concentration, the ^ m ax values are those measured at the 
lowest and highest concentrations, respectively. 

d Absorbance from the three protein-bound PCBs only, corrected for contributions from Tyr and 
Tip residues (Example A). 

e The phycocyanin a subunit domain has an average PCB content of -0.5. 
15 f The phycocyanin a subunit domain has an average PCB content of -0.25. 
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FURTHER EXAMPLES 

1. Phvcobiliproteins carrying the GCN4-pII or the GCN4-pLI coiled-coil oligomerization 
domain. These fusion proteins were found almost entirely in inclusion bodies in E. coli, but 
found incorporated in the phycobilisome in Anabaena sp. These constructs, as well as other 
examples discussed below, all show that proteins bearing the GCN4 oligomerization domains are 
found almost entirely in inclusion bodies when expressed in E. coli, but are incorporated in 
phycobilisomes when expressed in Anabaena sp. Upon phycobilisome dissociation, good yields 
of soluble, fluorescently tagged proteins are obtained. 

2. Streptavidin-phvcocvanin a subunit fusion proteins. The core streptavidin (stvC) gene used 
here encodes a StvC corresponding to residues 16 to 133 of the mature streptavidin. 
Recombinant StvC with a 24-residue N-terminal extension bearing a 6xHis tag and the TEV site. 
HT-StvC (encoded by plasmid pBS282), was expressed relatively poorly in E. coli, with about 
80% of the proteins found in inclusion bodies. The StvC-CpcA fusion protein has an 1 1 -residue 
bridge linking the two moieties. HT-StvC-CpcA (encoded by plasmid pBS292) was produced at 
relatively high level in E. coli, but was found only in inclusion bodies. 

The fusion protein An292 was well expressed in Anabaena sp. Ni 2+ -NTA-purified 
soluble' An292 protein accounted for >15% of total cellular phycobiliproteins. The An292 protein 
was of HT-StvC-a:p in composition, ran on SEC-HPLC as tetramer (HT-StvC-a:P)4 and trimer 
of tetramers [(HT-StvC-a:P)4]3, similar to the finding with An321 with phycocyanin a subunit 
bearing the tetramerization domain GCN4-pLI: (HT-pLI-a:P)4 and [(HT-pLI-cc:p)4]3. An292 
also had spectroscopic properties virtually identical to those of An321, making it an excellent 
fluorescent label. The An292 protein was assembled into phycobilisomes and also found to bind 
biotin. 

Like streptavidin tetramers, the (HT-StvC-a:p)4 tetramer was much more stable than the 
(HT-pLI-a:p)4 tetramer. On room temperature denaturing SDS-PAGE, (HT-StvC-a:P)4 



66 



B00-016-3 



released only the p subunits and retained the (HT-StvC-oc)4 aggregate. The tetramer could be 
broken only by boiling the protein before loading on SDS-PAGE. Purified phycobilisomes 
displaying StvC had a tendency to aggregate out of solution, likely a direct result of inter- 
phycobilisome linkage mediated by tetramerization of the monomeric StvC domains displayed 
on PBS. Usually within 48 hrs all such phycobilisomes precipitated out of solution. 
3. Protein A-phvcocvanin a subunit fusion proteins. 

Expression of truncated protein A (SpA) in the cytosol of E. coli has been shown to give 
very poor yield, with particularly severe proteolysis occuring to the hinge of domains A and B. In 
our construct, the truncated SpA protein [denoted SpA(DABC)] contains the hinge of domain E, 
domains D, A, and B in entirety, and domain C lacking a small portion of its hinge. Domains D. 
A, B and C all retain their IgG-binding structure. Plasrnid pBS356 encodes the recombinant SpA 
with a C-terminal 6xHis tag extending from the residual hinge region of domain C: SpA(DABC)- 
6xHis. The protein, Ec356, was obtained in extremely poor yield when expressed in E. coli, 
regardless whether the purification method was denaturing (using guanidine HC1; see e.g., 
Example B, supra) or not, and almost no full-length Ec356 protein was seen on SDS-PAGE. 
Recombinant protein A with engineered hinge regions has been described for better expression of 
the SpA in E.coli . Without such extensive reengineering, E. coli is not an appropriate organism 
for expression and production of recombinant protein A. 

In experiments described here, the SpA(DABC) protein is fused at its C-terminus to the 
phycocyanin a subunit through a 24-residue linker bearing a 6xHis tag and a recognition and 
cleavage site for the specific TEV endoprotease. The fusion protein, SpA(DABC)-6xHis-TEV- 
CpcA, encoded by plasrnid pBS349, was obtained in very poor yield when expressed in Exoli, 
likely a result of extensive proteolysis. Little full-length Ec349 protein was seen, with most of the 
Ni2+-NTA-purified protein having some or all of the IgG-binding domains missing. 

When expressed in Anabaena sp., the fusion protein An349 was found assembled into 
phycobilisomes with no apparent destabilizing effect on the light-harvesting complex. Cellular 
phycobiliprotein level relative to chlorophyll a, however, was about 20% lower, suggesting an 
elevated phycobiliprotein turnover. Nonetheless, growth rate of cultures under high-light 
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intensity was not affected. Ni2+-NTA-purified An349 proteins were obtained in relatively low 
yield of about 5% of total cellular phycobiliproteins. 

On SEC-HPLC, the An349 proteins fractionated into two components: >80% as the 
trimer (SpA-6xHis-a:P)3 and the rest being SpA-6xHis-a subunit. SDS-PAGE showed that the 
SpA moiety of the majority of the fusion protein had also been proteolyzed, giving full-length 
SpA(DABC)-6xHis-a (49.5 kDal; ca. 10%), SpA(ABC)-6xHis-oc (41.5 kDal; ca. 20%), and 
SpA(BC)-6xHis-ot (35.5 kDal; ca. 70%). No proteolytic cleavage on domain B hinge [giving 
SpA(C)-6xHis-a] and domain C hinge (giving 6xHis-oc) was observed. The lack of proteolytic 
cleavage on the domain B hinge is a clear indication of different protease activities in Anabaena 
sp. because that hinge region has been found to be the most susceptible to proteolytic cleavage in 
E. coli. The lack of clipping in the domain C hinge is likely due to its proximity to the 
phycobilisome, minimizing access by proteases. This hinge region is susceptible to proteolytic 
activities in Anabaena sp. when displayed farther away from the phycobilisome surface (see 
below). 

SDS-PAGE also showed that the carrier phycocyanin a domain had the normal bilin 
content. The SEC-HPLC trimer (SpA-6xHis-a:B)3 fraction had spectroscopic properties very 
similar to those of (6xHis-a:B)3, with absorbance maximum at 616 nm, an A616 nm:A360 nm 
ratio of 6.1, fluorescence excitation maximum at 618 nm (for 650 nm emission), and 
fluorescence emission maximum at 641 nm (for 560 nm excitation). 

An349 proteins were tested for their ability to bind IgG in ELIS A. Serially diluted An349 
protein was immobilized on a 96-well plate and then allowed to bind to mouse IgG-alkaline 
phosphatase conjugate. After thorough washing, the alkaline phosphatase activity was assayed by 
catalyzed color development. In such semi-quantitative assays, the SpA-6xHis-oc:B fusion protein 
had about 30% IgG-binding activity as compared to commercially obtained protein A (Sigma 
Chemical Co.) with all five IgG-binding domains. 

Another SpA-phycocyanin a fusion protein was constructed to enhance the fusion 
protein's utility. This construct is similar to the SpA(DABC)-6xHis-TEV-CpcA construct 
described above, but has the 33-residue GCN4-pH trimerization domain inserted between the 
6xHis tag and the TEV site. This fusion protein, SpA(DABC)-6xHis-pH-TEV-CpcA, encoded by 
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plasmid pBS354, was found only in inclusion bodies when expressed in E. coll Unlike Ec349 ; 
about 10% of the Ec354 proteins affinity-purified from inclusion bodies was full-length 
[SpA(DABC)-6xHis-pII-TEV-CpcA], suggesting fast sequestering of the newly synthesized 
polypeptides into inclusion bodies due to bundling by the GCN4-pH domain. The majority of 
purified Ec354 proteins, however, were still missing some or all of the SpA IgG-binding 
domains, again illustrating protein A's susceptibility to E. coli proteases. 

When expressed in Anabaena sp., the fusion protein An354 was found assembled into 
phycobilisomes with no apparent destabilizing effect on the complex. Interestingly, cells 
expressing An354 had a more normal phenotype than those expressing An349 (see above). Ni2+- 
NTA-purified An354 proteins were obtained in reasonable yield, accounting for >7% of total 
cellular phycobiliproteins . 

On SEC-HPLC, the An354 protein ran as a single peak with an apparent molecular 
weight corresponding to a hexamer, [(SpA-6xHis-pII-a:fl)3]2. SDS-PAGE confirmed the 
composition as SpA-6xHis-pII-a:B, and showed that most of the SpA moiety was subject to 
proteolytic clipping at different hinge regions, giving full-length SpA(DABC)-6xHis-pH-a (53.2 
kDal; ca. 10%), SpA(ABC)-6xHis-pII-a (45.2 kDal; ca. 20%), SpA(BC)-6xHis-pH-a (39.2 kDal: 
ca. 50%), and 6xHis-pH-a (25.2 kDal; ca. 20%). The complete removal of the displayed SpA 
protein by cleavage in the domain C hinge is not seen with the An349 protein, and is likely a 
result of the SpA moiety in An354 being displayed farther away from the phycobilisome surface. 
As with An349, no proteolytic cleavage on domain B hinge [giving SpA(C)-6xHis-pH-a] was 
observed in Anabaena sp. Different fractions collected from the An354 SEC-HPLC peak had 
identical distributions of the fusion proteins missing various IgG-binding domains, consistent 
with the view that the fusion proteins are displayed on the phycobilisome as monomers (with 
respect to the GCN4-pn domain), and only form GCN4-pII-bundled stable trimers after cell lysis 
and phycobilisome dissociation. 

SDS-PAGE also showed that the carrier phycocyanin a domain had the normal level of 
phycocyanobilin. The SEC-HPLC hexamer [(SpA-6xHis-pII-a:6)3]2 fraction had spectroscopic 
properties very similar to those of (6xHis-pH-a:p)3 (see, e.g. Example C, supra), with an 
absorbance maximum at 621 nm, an A616 nm:A360 nm ratio of 7.5, fluorescence excitation 
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maximum at 621 nm (for 650 nm emission), and fluorescence emission maximum at 642 nm (for 
560 nm excitation). These favorable spectroscopic properties of such GCN4-pE-bundled stable 
trimers make the fusion protein a good fluorescent tag. ELISA also showed that An354 had IgG- 
binding affinity comparable to commercially obtained protein A. The increased IgG-binding 
5 activity of An354 in ELISA compared to An349 may be a result of more protein being 
immobilized on the 96-well plate, and/or cooperative binding of GCN4-pH-bundled SpA 
proteins. Such oligomerization-induced binding affinity increase has been observed (see e.g., 
Examples B and C, supra). 

4. Cvanoglobin fusion to phvcocvanin a or fi subunit . Cyanoglobin (GlbN) is a 15.7-kDal 

10 monomeric hemoprotein produced in the cyanobacterium Nostoc commune. This protein contains 
a non-cpvalently linked heme, and is well expressed as soluble protein in both E. coli (expression 
at 30°C and 37°C) m&Anabaena sp. when bearing a 24-residue N-terminal extension containing 
a 6xHis tag and the TEV recognition and cleavage site (recombinant protein encoded by plasmid 
pBS121; Example C, supra). However, recombinant cyanoglobin with C-terminal extension (the 

15 stop codon is replaced by a Ser codon) was found unable to fold normally in E. coli. 

Cyanoglobin-phycocyanin subunit fusion constructs, GlbN-6xHis-TEV-CpcA (encoded by 
plasmid pBS190) and GlbN-6xHis-TEV-CpcB (encoded by plasmid pBS274), gave relatively 
poor yield and were found mostly in inclusion bodies when expression was carried out at 37°C. 
At 30°C, the two fusion proteins had somewhat different behavior. Although both were still fast 

20 degraded in E. coli (thus giving low yield), the GlbN-HTa fusion protein (Eel 90) was produced 
in a substantial amount as soluble protein, with apparently normal amount of heme and an 
absorbance spectrum identical to that of 6xHis-tagged GlbN (Ecl21). The GlbN-HTb fusion 
protein (Ec274), on the other hand, was still found mostly in inclusion bodies, with the small 
amount of soluble proteins purified largely as apoproteins (without heme). These results indicate 

25 that the C-terminal extensions have slowed the folding of GlbN in £. coli, and the cyanoglobin 
domain in the GlbN-HTb fusion protein appears to be misfolded. 

Anabaena cultures expressing either fusion proteins were healthy, showing no negative 
phenotypes at all. Both fusion proteins, An 190 and An274, were found assembled into 
phycobilisomes with no destabilizing effect on the macrostructures, were purified in high yield 
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(each accounting for >20% of total cellular phycobiliproteins) with cognate partner subunits in 
1:1 ratio, and found to have spectroscopic properties very similar to native phycocyanin. 

The cyanoglobin moiety in the fusion proteins, however, behaved quite differently. Using 
phycocyanin a subunit as the carrier domain (An 190), the displayed GlbN domain was <30% 
5 holo when eluted off the Ni2+-NTA column, and lost almost all of the bound heme upon dialysis 
to remove imidazole. Using the phycocyanin p subunit as the carrier domain (An274), most of 
the displayed GlbN domain was purified with bound heme, and the heme remained bound to 
GlbN after dialysis. No degradation of the cyanoglobin domain was observed in either An 1 90 or 
An274 in Anabaena cells, suggesting that the cyanoglobin domain displayed on the N' of 

10 phycocyanin a subunit was misfolded but remained resistant to proteolysis. The misfolded GlbN 
domain appears to interfere with the carrier domain's formation of phycocyanin trimers: on SEC- 
HPLC, about 80% of Anl90 proteins ran as trimers, (GblN-HT <x:P)3, while the rest ran as 
monomers, GblN-HT a:P, in contrast to Anl68 which runs only as trimers, (HT a:P)3. Like 
An262, (a:HTB)3, An274 ran only as trimers, (a:GlbN-HTJi)3, on SEC-HPLC. 

15 In sum, both fusion proteins were quickly degraded in E. coli, but well displayed on 

phycobilisomes in Anabaena sp. Cyanoglobin displayed on the of phycocyanin 1} subunit seemed 
to fold better than on the N-terminus of the a subunit, and was able to retain larger amount of 
heme than the latter during purification. No degradation of the cyanoglobin domain was observed 
in either construct in Anabaena cells, suggesting that the cyanoglobin domain displayed on the 

20 N-terminus of phycocyanin a subunit was misfolded but remained resistant to proteolysis. 

5. Sperm whale myoglobin fusion to phycocyanin a or B subunit. The myoglobin from sperm 
whale (SWM6) is similar to cyanoglobin as a monomeric protein with non-covalently bound 
heme. SWMfi with a C-terminal extension of only six His residues (SWMp-6xHis; encoded by 
plasmid pBS 1 14) can be well expressed as soluble holoprotein in both E. coli and Anabaena sp. 

25 Recombinant SWMP proteins with larger C-terminal extensions, however, appear to be unable to 
fold. SWMP-6xHis-TEV-LacZ a (encoded by plasmid pBS261), SWM8-6xHis-TEV-CpcA 
(encoded by plasmid pBS271), and SWMb-6xHis-TEV-CpcB (encoded by plasmid pBS318) 
fusion proteins all gave poor yields when expressed in E. coli and were found nearly entirely in 
inclusion bodies. 
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Anabaena cultures expressing either An271 (SWMB-HTa) or An361 (SWMJ3-HTB) 
fusion proteins were healthy, and the fusion proteins were found assembled into phycobilisomes, 
and were purified with cognate partner phycocyanin subunits. Although myoglobin displayed on 
the N-terminus of phycocyanin B subunit (SWMB-HTfl) gave slightly better yield of the full 
5 length protein (with little amount of bound heme), both fusion proteins were purified mostly 
without the myoglobin moiety. This indicates that the displayed domain, if unable to fold, is 
degraded. 

6. Maltose-binding protein (MalE, MBP) fusion to phycocyanin a or fi subunit. The "mature" 
recombinant MalE protein (lacking the membrane-crossing signal peptide) with a C-terminal 

10 6xHis tag is well expressed as a soluble protein in the cytosol of both Anabaena sp. and E. coll. 
In the MalE-phycocyanin fusion constructs, the last three residues (Arg-Ile-Thr) of the mature 
MalE are replaced by the spacer Asn-Ser-Ser, and this modified MalE is fused to the N-terminus 
of a phycocyanin subunit via a 24-residue linker that bears a 6xHis tag and a recognition and 
cleavage site for the TEV protease. The MalE-6xHis-TEV-oc (Ec396) and MalE-6xHis-TEV-B 

15 (Ec398) fusion proteins are both expressed as soluble proteins in E. coli in relatively high yield. 

The strain Anabaena sp. PCC7120(pBS396) expressing fusion protein MalE-6xHis-TEV~ 

a had a phenotype very similar to that of strain Anabaena sp. PCC7120(pBS 168) expressing 

HTa. Sucrose density gradient sedimentation fractionation showed that the MalE-6xHis-TEV-a 

fusion protein was assembled into the phycobilisome without obvious effects on the 

2+ 

20 macromolecular complex. Ni -NTA affinity-purified protein, An396, was in relatively good 
yield, accounting for >10% of total cellular phycobiliproteins. Over 95% of the An396 proteins 
can bind to, and be specifically eluted from amylose columns, diagnostic of functional MalE. 

SDS-PAGE showed that the An396 protein has the composition MalE-6xHis-TEV-a:fl, 
and Zn2+-induced fluorescence indicates full chromophorylation of phycocyanin carrier domain. 

25 SEC-HPLC showed that An396 protein was monomeric, (MalE-6xHis-TEV-cc:fl) a behavior 

differing from that of HTa.B, whose elution volume is that of a trimer. The MalE-6xHis-TEV- 
a:J5 protein in the SEC-HPLC monomer peak fraction, however, has spectroscopic properties 
very similar to the (HTa:B)3 trimer, with absorbance maximum at 617 nm, fluorescence 
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excitation maximum at 618 nm (for 650 nm emission), fluorescence emission max. at 637 nm 
(for 560 nm excitation), and a fluorescence quantum yield of 0.23. 

Upon digestion with the TEV protease, over 90% of purified MalE-6xHis-TEV-a:B 
protein can be clipped at the TEV recognition site to separate the displayed domain (MalE) from 
5 the phycocyanin carrier domain. 

The fusion protein MalE-6xHis-TEV-B is expressed in Anabaena sp. at a level (>7.5% of 
total cellular phycobiliproteins) comparable to that of MalE-6xHis-TEV-oc (An396). Over 95% 
of Ni2+-NTA-purified An398 protein can bind to, and then be specifically eluted from, the 
amylose resin, indicative of the MalE function. However, analysis of An398 by SDS-PAGE 

10 shows that the protein preparation consists mostly of MalE-6xHis-TEV-B with only a very small 
amount of the partner phycocyanin a subunit. Zn2+-induced fluorescence suggests that the 
MalE-6xHis-TEV-B protein has a full complement of phycocyanobilin, i. e., two PCBs per B 
subunit domain. On SEC-HPLC most of the preparation runs as (MalE-6xHis-TEV-fl)2 
homodimer, with a slight shoulder corresponding to the (oc:MalE-6xHis-TEV-B) monomer. Like 

15 MalE-6xHis-TEV-a:B (An396), virtually all of the MalE-6xHis-TEV-B fusion proteins can be 
cleaved at the TEV site to separate the displayed protein (MalE) and the carrier protein 
(phycocyanin B subunit). 

The MalE-6xHis-TEV-B protein is found to be assembled into phycobilisomes without 
obvious deleterious effect on the latter structures. Since assembly of the fusion protein into the 

20 phycobilisomes requires association with the phycocyanin a subunit, this result indicates that the 
MalE-6xHis-TEV-B fusion protein has a reduced affinity to the a subunit which is likely lost 
during the purification process. In the absence of a subunits, phycocyanin B subunits have been 
shown \o form stable homodimers (see, e.g. Example A, supra). Such interference by the 
displayed domain on the phycocyanin subunit carrier domain's affinity for its cognate partner 

25 subunit is likely fusion protein specific - cyanoglobin displayed on the N-terminus of the 

phycocyanin B subunit does not appear to have obvious effects on the fusion protein's affinity for 
the phycocyanin a subunit. 

The (MalE-6xHis-TEV-B)2 homodimer peak fraction in SEC-HPLC was used for 
spectroscopic characterization. Similar to the (HTB)2 homodimer protein, the (MalE-6xHis- 
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TEV-B)2 homodimer has absorbance and fluorescence excitation maxima at 605 nm, a 
fluorescence emission maximum at 638 nm, and a fluorescence quantum yield of 0.26. 

To obtain 1:1 oc:B stoichiometry, the B-Ll 1-oc subunit-fusion phycocyanin (Example B, 
supra) can be used, in place of the B subunit, as a carrier protein. Constructs incorporating the 
GCN4-pH trimerization domain, such as (MalE-6xHis-pII-a:B)3 and (MalE-6xHis-pII-B-Li 1- 
a)3, have excellent spectroscopic properties, and are very useful, for instance, in studies of 
protein glycosylation. 

All publications and patent applications cited in this specification and all references cited 
therein are herein incorporated by reference as if each individual publication or patent application 
or reference were specifically and individually indicated to be incorporated by reference. 
Although the foregoing invention has been described in some detail by way of illustration and 
example for purposes of clarity of understanding, it will be readily apparent to those of ordinary 
skill in the art in light of the teachings of this invention that certain changes and modifications 
may be made thereto without departing from the spirit or scope of the appended claims. 
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